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ADENO-ASSOCIATED VIRUS (AAV) SEROTYPE 8 SEQUENCES, VECTORS 
CONTAINING SAME, AND USES THEREFOR 

BACKGROUND OF THE INVENTION 
5 Adeno-associated virus (AAV), a member of the Parvovirus family, is a 

small nonenveloped, icosahedral virus with single-stranded linear DNA genomes of 4.7 
kilobases (kb) to 6 kb. AAV is assigned to the genus, Dependovirus, because the virus 
was discovered as a contaminant in purified adenovirus stocks. AAV's life cycle includes 
a latent phase at which AAV genomes, after infection, are site specifically integrated into 

10 host chromosomes and an infectious phase in which, following either adenovirus or 
herpes simplex virus infection, the integrated genomes are subsequently rescued, 
replicated, and packaged into infectious viruses. The properties of non-pathogenicity, 
broad host range of infectivity, including non-dividing cells, and potential site-specific 
chromosomal integration make AAV an attractive tool for gene transfer. 

15 Recent studies suggest that AAV vectors may be the preferred vehicle for 

gene delivery. To date, there have been 6 different serotypes of AAVs isolated from 
human or non-human primates (NHP) and well characterized. Among them, human 
serotype 2 is the first AAV that was developed as a gene transfer vector; it has been 
widely used for efficient gene transfer experiments in different target tissues and animal 

20 models. Clinical trials of the experimental application of AAV2 based vectors to some 
human disease models are in progress, and include such diseases as cystic fibrosis and 
hemophilia B. 

What are desirable are AAV-based constructs for gene delivery. 

25 SUMMARY OF THE INVENTION 

In one aspect, the invention provides novel AAV sequences, compositions 

containing these sequences, and uses therefor. Advantageously, these compositions are 

particularly well suited for use in compositions requiring re-administration of rAAV for 

therapeutic or prophylactic purposes. 
30 These and other aspects of the invention will be readily apparent from the 

following detailed description of the invention. 
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BRIEF DESCRIPTION OF THE DRAWINGS 

Figs. 1A through 1C are the nucleic acid sequences of the rep and cap regions of 
AAV8 [SEQIDNO:l]. 

Figs. 2A through 2C are the amino acid sequences of the AAV8 capsid vpl 
5 protein [SEQ ID NO:2], provided in alignment with the vpl of the published sequences of 
AAV2 [SEQ ID NO:4], AAV1 [SEQ ID NO:5], and AAV3 [SEQ ID NO:6], and newly 
identified AAV serotypes AAV7 [SEQ ID NO: 8] and AAV9 [SEQ ID NO:7]. The 
alignment was performed using the Clustal W program, with the number of AAV2 used 
for reference. Underlining and bold at the bottom sequence of the alignment indicates 
10 cassettes of identity. The dots in the alignment indicate that the amino acids are missing 
at the positions in the alignment as compared to AAV2 VP1. 

Figs. 3 A through 3C are the amino acid sequences of the AAV8 rep proteins 
[SEQIDNO:3]. 

1 5 DETAILED DESCRIPTION OF THE INVENTION 

The invention provides the nucleic acid sequences and amino acids of a novel 
AAV serotype, AAV8. Also provided are fragments of these AAV sequences. Each of 
these fragments may be readily utilized in a variety of vector systems and host cells. 
Among desirable AAV8 fragments are the cap proteins, including the vpl, vp2, vp3 and 

20 hypervariable regions, the rep proteins, including rep 78, rep 68, rep 52, and rep 40, and 
the sequences encoding these proteins. These fragments may be readily utilized in a 
variety of vector systems and host cells. Such fragments may be used alone, in 
combination with other AAV8 sequences or fragments, or in combination with elements 
from other AAV or non-AAV viral sequences. In one particularly desirable embodiment, 

25 a vector contains the AAV8 cap and/or rep sequences of the invention. 

The AAV8 sequences and fragments thereof are useful in production of rAAV, 
and are also useful as antisense delivery vectors, gene therapy vectors, or vaccine vectors. 
The invention further provides nucleic acid molecules, gene delivery vectors, and host 
cells which contain the AAV8 sequences of the invention. 

30 Suitable fragments can be determined using the information provided herein. 

Alignments are performed using any of a variety of publicly or commercially available 
Multiple Sequence Alignment Programs, such as "Clustal W", accessible through Web 
Servers on the internet. Alternatively, Vector NTI utilities are also used. There are also a 
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number of algorithms known in the art which can be used to measure nucleotide sequence 
identity, including those contained in the programs described above. As another example, 
polynucleotide sequences can be compared using Fasta, a program in GCG Version 6.1. 
Fasta provides alignments and percent sequence identity of the regions of the best overlap 
5 between the query and search sequences. For instance, percent sequence identity 

between nucleic acid sequences can be determined using Fasta with its default parameters 
(a word size of 6 and the NOPAM factor for the scoring matrix) as provided in GCG 
Version 6.1, herein incorporated by reference. Similar programs are available for amino 
acid sequences, e.g., the "Clustal X" program. Generally, any of these programs are used 
10 at default settings, although one of skill in the art can alter these settings as needed. 
Alternatively, one of skill in the art can utilize another algorithm or computer program 
which provides at least the level of identity or alignment as that provided by the 
referenced algorithms and programs. 

The term "substantial homology" or "substantial similarity," when referring to a 
15 nucleic acid, or fragment thereof, indicates that, when optimally aligned with appropriate 
nucleotide insertions or deletions with another nucleic acid (or its complementary strand), 
there is nucleotide sequence identity in at least about 95 to 99% of the aligned sequences. 
Preferably, the homology is over full-length sequence, or an open reading frame thereof, 
or another suitable fragment which is at least 15 nucleotides in length. Examples of 
20 suitable fragments are described herein. 

The term "substantial homology" or "substantial similarity," when referring to 
amino acids or fragments thereof, indicates that, when optimally aligned with appropriate 
amino acid insertions or deletions with another amino acid (or its complementary strand), 
there is amino acid sequence identity in at least about 95 to 99% of the aligned sequences. 
25 Preferably, the homology is over full-length sequence, or a protein thereof, e.g., a cap 
protein, a rep protein, or a fragment thereof which is at least 8 amino acids, or more 
desirably, at least 15 amino acids in length. Examples of suitable fragments are described 
herein. 

By the term "highly conserved" is meant at least 80% identity, preferably at least 
30 90% identity, and more preferably, over 97% identity. Identity is readily determined by 
one of skill in the art by resort to algorithms and computer programs known by those of 
skill in the art. 
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The term "percent sequence identity" or "identical" in the context of nucleic 
acid sequences refers to the residues in the two sequences which are the same when 
aligned for maximum correspondence. The length of sequence identity comparison 
may be over the full-length of the genome, the full-length of a gene coding sequence, 
5 or a fragment of at least about 500 to 5000 nucleotides, is desired. However, identity 
among smaller fragments, e.g. of at least about nine nucleotides, usually at least about 
20 to 24 nucleotides, at least about 28 to 32 nucleotides, at least about 36 or more 
nucleotides, may also be desired. Similarly, "percent sequence identity" may be 
readily determined for amino acid sequences, over the full-length of a protein, or a 
10 fragment thereof. Suitably, a fragment is at least about 8 amino acids in length, and 
may be up to about 700 amino acids. Examples of suitable fragments are described 
herein. 

As described herein, the vectors of the invention containing the AAV capsid 
proteins of the invention are particularly well suited for use in applications in which the 
15 neutralizing antibodies diminish the effectiveness of other AAV serotype based vectors, 
as well as other viral vectors. The rAAV vectors of the invention are particularly 
. advantageous in rAAV readministration and repeat gene therapy. 

These and other embodiments and advantages of the invention are described in 
more detail below. As used throughout this specification and the claims, the term 
20 "comprising" is inclusive of other components, elements, integers, steps and the like. 
Conversely, the term "consisting" and its variants are exclusive of other components, 
elements, integers, steps and the like. 

I. AAV Serotype 8 Sequences 

25 A. Nucleic Acid Sequences 

The AAV8 nucleic acid sequences of the invention include the DNA 
sequences of Fig. 1 [SEQ ID NO: 1], which consists of 4396 nucleotides. The AAV8 
nucleic acid sequences of the invention further encompass the strand which is 
complementary to Fig. 1 [SEQ ID NO: 1], as well as the RNA and cDNA sequences 

30 corresponding to Fig. 1 [SEQ ID NO: 1] and its complementary strand. Also included in 
the nucleic acid sequences of the invention are natural variants and engineered 
modifications of Fig. 1 [SEQ ID NO: 1] and its complementary strand. Such 
modifications include, for example, labels which are known in the art, methylation, and 
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substitution of one or more of the naturally occurring nucleotides with a degenerate 
nucleotide. 

Further included in this invention are nucleic acid sequences which are 
greater than about 90%, more preferably at least about 95%, and most preferably at least 
5 about 98 to 99% identical or homologous to Fig. 1 [SEQ ID NO:l]. 

Also included within the invention are fragments of Fig. 1 [SEQ ID NO: 
1], its complementary strand, cDNA and RNA complementary thereto. Suitable 
fragments are at least 15 nucleotides in length, and encompass functional fragments, i.e., 
fragments which are of biological interest. Such fragments include the sequences 

10 encoding the three variable proteins (vp) of the AAV8 capsid which are alternative splice 
variants: vpl [nt 2121 to 4335 of Fig. 1, SEQ ID NO:l]; vp2 [nt 2532 to 4335 of Fig. 1, 
SEQ ID NO:l]; and vp 3 [nt 2730 to 4335 of Fig. 1, SEQ ID NO:l]. Other suitable 
fragments of Fig. 1 [SEQ ID NO:l], include the fragment which contains the start codon 
for the AAV8 capsid protein, and the fragments encoding the hypervariable regions of the 

15 vpl capsid protein, which are described herein, 

Still other fragments include those encoding the rep proteins, including 
rep 78 [initiation codon located at nt 227 of Fig. 1, SEQ ID NO:l], rep 68 [initiation 
codon located at nt 227 of Fig. 1, SEQ ID NO:l], rep 52 [initiation codon located at nt 
905 of Fig. 1, SEQ ID NO:l], and rep 40 [initiation codon located at nt 905 of Fig. 1, 

20 SEQ ID NO:l]. Other fragments of interest may include the AAV8 inverted terminal 
repeat which can be identified by the methods described herein, AAV PI 9 sequences, 
AAV8 P40 sequences, the rep binding site, and the terminal resolute site (TRS). Still 
other suitable fragments will be readily apparent to those of skill in the art. 

In addition to including the nucleic acid sequences 

25 provided in the figures and Sequence Listing, the present invention includes nucleic acid 
molecules and sequences which are designed to express the amino acid sequences, 
proteins and peptides of the AAV serotypes of the invention. Thus, the invention 
includes nucleic acid sequences which encode the following novel AAV amino acid 
sequences and artificial AAV serotypes generated using these sequences and/or unique 

3 0 fragments thereof 

As used herein, artificial AAV serotypes include, without limitation, AAV 
with a non-naturally occurring capsid protein. Such an artificial capsid may be generated 
by any suitable technique, using a novel AAV sequence of the invention (e.g., a fragment 
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of a vpl capsid protein) in combination with heterologous sequences which may be 
obtained from another AAV serotype (known or novel), non-contiguous portions of the 
same AAV serotype, from a non-AAV viral source, or from a non-viral source. An 
artificial AAV serotype may be, without limitation, a chimeric AAV capsid, a 
5 recombinant AAV capsid, or a "humanized" AAV capsid. 

B. AAV8 Amino Acid Sequences, Proteins and Peptides 

The invention further provides proteins and fragments thereof which are 
encoded by the AAV8 nucleic acids of the invention, and AAV8 amino acids which are 
generated by other methods. The invention further encompasses AAV serotypes 

1 0 generated using sequences of the novel AAV serotype of the invention, which are 

generated using synthetic, recombinant or other techniques known to those of skill in the 
art. The invention is not limited to novel AAV amino acid sequences, peptides and 
proteins expressed from the novel AAV nucleic acid sequences of the invention and 
encompasses amino acid sequences, peptides and proteins generated by other methods 

15 known in the art, including, e.g., by chemical synthesis, by other synthetic techniques, or 
by other methods. For example, the sequences of any of be readily generated using a 
variety of techniques. 

Suitable production techniques are well known to those of skill in the art. 
See, e.g., Sambrook et al, Molecular Cloning: A Laboratory Manual, Cold Spring Harbor 

20 Press (Cold Spring Harbor, NY). Alternatively, peptides can also be synthesized by the 
well known solid phase peptide synthesis methods (Merrifield, J. Am. Chem. Soc, 
85:2149 (1962); Stewart and Young, Solid Phase Peptide Synthesis (Freeman, San 
Francisco, 1969) pp. 27-62). These and other suitable production methods are within the 
knowledge of those of skill in the art and are not a limitation of the present invention. 

25 Particularly desirable proteins include the AAV capsid proteins, which are 

encoded by the nucleotide sequences identified above. The AAV capsid is composed of 
three proteins, vpl, vp2 and vp3, which are alternative splice variants. The full-length 
sequence provided in figure 2 is that of vpl. The AAV8 capsid proteins include vpl [aa 1 
to 737 of SEQ ID NO:2], vp2 [aa 138 to 737 of SEQ ID NO:2], and vp3 [aa 203 to 737 of 

30 SEQ ID NO: 2] and functional fragments thereof. Other desirable fragments of the capsid 
protein include the constant and variable regions, located between hypervariable regions 
(HPV). Other desirable fragments of the capsid protein include the HPV themselves. 
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An algorithm developed to determine areas of sequence divergence in 
AAV2 has yielded 12 hypervariable regions (HVR) of which 5 overlap or are part of the 
four previously described variable regions. [Chiorini et al 9 1 Virol, 73:1309-19 (1999); 
Rutledge etal, J. Virol, 72:309-319] Using this algorithm and/or the alignment 
5 techniques described herein, the HVR of the novel AAV serotypes are determined. For 
example, with respect to the number of the AAV2 vpl [SEQ ID NO:4], the HVR are 
located as follows: HVR1, aa 146-152; HVR2, aa 182-186; HVR3, aa 262-264; HVR4 5 
aa 381-383; HVR5, aa 450-474; HVR6, aa 490-495; HVR7, aa500-504; HVR8, aa 514- 
522; HVR9, aa 534-555; HVR10, aa 581-594; HVR1 1, aa 658-667; and HVR12, aa 705- 
10 719. Using the alignment provided herein performed using the Clustal X program at 

default settings, or using other commercially or publicly available alignment programs at 
default settings, one of skill in the art can readily determine corresponding fragments of 
the novel AAV capsids of the invention. 

Still other desirable fragments of the AAV8 capsid protein include 
15 amino acids 1 to 184 of SEQ ID NO: 2, amino acids 199 to 259; amino acids 274 to 
446; amino acids 603 to 659; amino acids 670 to 706; amino acids 724 to 736 of SEQ 
ID NO:2; aa 185 - 198; aa 260-273; aa447-477; aa495-602; aa660-669; and aa707- 
723. Additionally, examples of other suitable fragments of AAV capsids include, with 
respect to the numbering of AAV2 [SEQ ID NO:4], aa 24 - 42, aa 25 - 28; aa 81 - 85; 
20 aal33-165; aa 134 - 165; aa 137-143; aa 154-156; aa 194-208; aa 261-274; aa 262-274; 
aa 171-173; aa 413-417; aa 449-478; aa 494-525; aa 534-571; aa 581-601; aa 660-671; aa 
709-723. Still other desirable fragments include, for example, in AAV7, amino acids 1 to 
184 of SEQ ID NO:2, amino acids 199 to 259; amino acids 274 to 446; amino acids 603 
to 659; amino acids 670 to 706; amino acids 724 to 736; aa 185 to 198; aa 260 to 273; 
25 aa447 to 477; aa495 to 602; aa660 to 669; and aa707 to 723. Using the alignment 

provided herein performed using the .Clustal X program at default settings, or using other 
commercially or publicly available alignment programs at default settings, one of skill in 
the art can readily determine corresponding fragments of the novel AAV capsids of the 
invention. 

30 Still other desirable AAV8 proteins include the rep proteins include 

rep68/78 and rep40/52 [located within aa 1 to 625 of SEQ ID NO: 3], Suitable fragments 
of the rep proteins may include aa 1 to 102; aa 103 to 140; aa 141 to 173; aa 174 to 226; 
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aa 227 to 275; aa 276 to 374; aa 375 to 383; aa 384 to 446; aa 447 to 542; aa 543 to 555; 
aa 556 to 625 5 of SEQ ID NO: 3. 

Suitably, fragments are at least 8 amino acids in length. However, 
fragments of other desired lengths may be readily utilized. Such fragments may be 
5 produced recombinantly or by other suitable means, e.g., chemical synthesis. 

The invention further provides other AAV8 sequences which are 
identified using the sequence information provided herein. For example, given the AAV8 
sequences provided herein, infectious AAV8 may be isolated using genome walking 
technology (Siebert etal, 1995, Nucleic Acid Research, 23:1087-1088, Friezner-Degen et 

10 al, 1986, J. Biol Chem. 261:6972-6985, BD Biosciences Clontech, Palo Alto, CA). 
Genome walking is particularly well suited for identifying and isolating the sequences 
adjacent to the novel sequences identified according to the method of the invention. This 
technique is also useful for isolating inverted terminal repeat (ITRs) of the novel AAV8 
serotype, based upon the novel AAV capsid and rep sequences provided herein. 

15 The sequences, proteins, and fragments of the invention may be produced 

by any suitable means, including recombinant production, chemical synthesis, or other 
synthetic means. Such production methods are within the knowledge of those of skill in 
the art and are not a limitation of the present invention. 

20 IV. Production of r AAV with AA V8 Capsids 

The invention encompasses novel, wild-type AAV8, the sequences of which are 
free of DNA and/or cellular material with these viruses are associated in nature. In 
another aspect, the present invention provides molecules which utilize the novel AAV 
sequences of the invention, including fragments thereof, for production of molecules 

25 useful in delivery of a heterologous gene or other nucleic acid sequences to a target cell. 

In another aspect, the present invention provides molecules which utilize the 
AAV8 sequences of the invention, including fragments thereof, for production of viral 
vectors useful in delivery of a heterologous gene or other nucleic acid sequences to a 
target cell. 

30 The molecules of the invention which contain AAV8 sequences include any 

genetic element (vector) which may be delivered to a host cell, e.g., naked DNA, a 
plasmid, phage, transposon, cosmid, episome, a protein in a non-viral delivery vehicle 
(e.g., a lipid-based carrier), virus, etc. which transfer the sequences carried thereon. 
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The selected vector may be delivered by any suitable method, including transfection, 
electroporation, liposome delivery, membrane fusion techniques, high velocity DNA- 
coated pellets, viral infection and protoplast fusion. The methods used to construct any 
embodiment of this invention are known to those with skill in nucleic acid manipulation 
5 and include genetic engineering, recombinant engineering, and synthetic techniques. See, 
e.g., Sambrook et al, Molecular Cloning: A Laboratory Manual, Cold Spring Harbor 
Press, Cold Spring Harbor, NY. 

In one embodiment, the vectors of the invention contain, at a minimum, 
sequences encoding an AAV8 capsid or a fragment thereof. In another embodiment, the 
10 vectors of the invention contain, at a minimum, sequences encoding an AAV8 rep protein 
or a fragment thereof. Optionally, such vectors may contain both AAV cap and rep 
proteins. In vectors in which both AAV rep and cap are provides, the AAV rep and AAV 
cap sequences can both be of AAV8 origin. Alternatively, the present invention provides 
vectors in which the rep sequences are from an AAV serotype which differs from that 
15 which is providing the cap sequences. In one embodiment, the rep and cap sequences are 
expressed from separate sources (e.g., separate vectors, or a host cell and a vector). In 
another embodiment, these rep sequences are fused in frame to cap sequences of a 
different AAV serotype to form a chimeric AAV vector. Optionally, the vectors of the 
invention further contain a minigene comprising a selected transgene which is flanked by 
20 AAV 5' ITR and AAV 3' ITR. 

Thus, in one embodiment, the vectors described herein contain nucleic acid 
sequences encoding an intact AAV capsid which may be from a single AAV serotype 
(e.g., AAV8). Such a capsid may comprise amino acids 1 to 738 of SEQ ID NO:2. 
Alternatively, these vectors contain sequences encoding artificial capsids which contain 
25 one or more fragments of the AAV8 capsid fused to heterologous AAV or non-AAV 

capsid proteins (or fragments thereof). These artificial capsid proteins are selected from 
non-contiguous portions of the AAV8 capsid or from capsids of other AAV serotypes. 
For example, a rAAV may have a capsid protein comprising one or more of the AAV8 
capsid regions selected from the vp2 and/or vp3, or from vp 1, or fragments thereof 
30 selected from amino acids 1 to 184, amino acids 199 to 259; amino acids 274 to 446; 
amino acids 603 to 659; amino acids 670 to 706; amino acids 724 to 738 of the AAV8 
capsid, SEQ ID NO: 2. In another example, it may be desirable to alter the start codon of 
the vp3 protein to GTG. Alternatively, the rAAV may contain one or more of the AAV 
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serotype 8 capsid protein hypervariable regions which are identified herein, or other 
fragment including, without limitation, aa 185 - 198; aa 260-273; aa447~477; aa495-602; 
aa660-669; and aa707-723 of the AAV8 capsid. See, SEQ ID NO: 2. These 
modifications may be to increase expression, yield, and/or to improve purification in the 
5 selected expression systems, or for another desired purpose (e.g., to change tropism or 
alter neutralizing antibody epitopes). 

The vectors described herein, e.g., a plasmid, are useful for a variety of purposes, 
but are particularly well suited for use in production of a rAAV containing a capsid 
comprising AAV sequences or a fragment thereof. These vectors, including rAAV, their 
10 elements, construction, and uses are described in detail herein. 

In one aspect, the invention provides a method of generating a recombinant 
adeno-associated virus (AAV) having an AAV serotype 8 capsid, or a portion thereof. 
Such a method involves culturing a host cell which contains a nucleic acid sequence 
encoding an adeno-associated virus (AAV) serotype 8 capsid protein, or fragment thereof, 
15 as defined herein; a functional rep gene; a minigene composed of, at a minimum, AAV 
inverted terminal repeats (ITRs) and a transgene; and sufficient helper functions to permit 
packaging of the minigene into the AAV8 capsid protein. 

The components required to be cultured in the host cell to package an AAV 
minigene in an AAV capsid may be provided to the host cell in trans. Alternatively, any 
20 one or more of the required components (e.g., minigene, rep sequences, cap sequences, 
and/or helper functions) may be provided by a stable host cell which has been engineered 
to contain one or more of the required components using methods known to those of skill 
in the art. Most suitably, such a stable host cell will contain the required component(s) 
under the control of an inducible promoter. However, the required component(s) may be 
25 under the control of a constitutive promoter. Examples of suitable inducible and 
constitutive promoters are provided herein, in the discussion of regulatory elements 
suitable for use with the transgene. In still another alternative, a selected stable host cell 
may contain selected component(s) under the control of a constitutive promoter and other 
selected component(s) under the control of one or more inducible promoters. For 
30 example, a stable host cell may be generated which is derived from 293 cells (which 
contain El helper functions under the control of a constitutive promoter), but which 
contains the rep and/or cap proteins under the control of inducible promoters. Still other 
stable host cells may be generated by one of skill in the art. 
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The minigene, rep sequences, cap sequences, and helper functions required for 
producing the rAAV of the invention may be delivered to the packaging host cell in the 
form of any genetic element which transfer the sequences carried thereon. The selected 
genetic element may be delivered by any suitable method, including those described 
5— herein. The methods used to construct any embodiment-of this invention are known to 
those with skill in nucleic acid manipulation and include genetic engineering, 
recombinant engineering, and synthetic techniques. See, e.g., Sambrook et al, Molecular 
Cloning: A Laboratory Manual, Cold Spring Harbor Press, Cold Spring Harbor, NY. 
Similarly, methods of generating rAAV virions are well known and the selection of a 
10 suitable method is not a limitation on the present invention. See, e.g., K. Fisher et al, J. 
Virol, 70:520-532 (1993) and US Patent 5,478,745. 

Unless otherwise specified, the AAV ITRs, and other selected AAV components 
described herein, may be readily selected from among any AAV serotype, including, 
without limitation, AAV1, AAV2, AAV3, AAV4, AAV5, AAV6, AAV7, AAV9 and the 
15 novel serotype of the invention, AAV8. These ITRs or other AAV components may be 
readily isolated using techniques available to those of skill in the art from an AAV 
serotype. Such AAV may be isolated or obtained from academic, commercial, or public 
sources (e.g., the American Type Culture Collection, Manassas, VA). Alternatively, the 
AAV sequences may be obtained through synthetic or other suitable means by reference 
20 to published sequences such as are available in the literature or in databases such as, e.g., 
GenBank, PubMed, or the like. 

A. The Minigene 

The minigene is composed of, at a minimum, a transgene and its 
regulatory sequences, and 5' and 3' AAV inverted terminal repeats (ITRs). In one 
25 desirable embodiment, the ITRs of AAV serotype 2 are used. However, ITRs from other 
suitable serotypes may be selected. It is this minigene which is packaged into a capsid 
protein and delivered to a selected host cell. 

1 . The transgene 

The transgene is a nucleic acid sequence, heterologous to 
30 the vector sequences flanking the transgene, which encodes a polypeptide, protein, or 
other product, of interest. The nucleic acid coding sequence is operatively linked to 
regulatory components in a manner which permits transgene transcription, translation, 
and/or expression in a host cell. 
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The composition of the transgene sequence will depend 
upon the use to which the resulting vector will be put. For example, one type of 
transgene sequence includes a reporter sequence, which upon expression, produces a 
detectable signal. Such reporter sequences include, without limitation, DNA sequences 
5 encoding P-lactamase, P-galactosidase (LacZ), alkaline phosphatase, thymidine kinase, 
green fluorescent protein (GFP), chloramphenicol acetyltransferase (CAT), luciferase, 
membrane bound proteins including, for example, CD2, CD4, CDS, the influenza 
hemagglutinin protein, and others well known in the art, to which high affinity antibodies 
directed thereto exist or can be produced by conventional means, and fusion proteins 

10 comprising a membrane bound protein appropriately fused to an antigen tag domain from, 
among others, hemagglutinin or Myc. 

These coding sequences, when associated with regulatory 
elements which drive their expression, provide signals detectable by conventional means, 
including enzymatic, radiographic, colorimetric, fluorescence or other spectrograph^ 

15 assays, fluorescent activating cell sorting assays and immunological assays, including 
enzyme linked immunosorbent assay (ELISA), radioimmunoassay (RIA) and 
immunohistochemistry. For example, where the marker sequence is the LacZ gene, the 
presence of the vector carrying the signal is detected by assays for beta-galactosidase 
activity. Where the transgene is green fluorescent protein or luciferase, the vector 

20 carrying the signal may be measured visually by color or light production in a 
luminometer. 

However, desirably, the transgene is a non-marker 
sequence encoding a product which is useful in biology and medicine, such as proteins, 
peptides, RNA, enzymes, or catalytic RNAs. Desirable RNA molecules include tRNA, 
25 dsRNA, ribosomal RNA, catalytic RNAs, and antisense RNAs. One example of a useful 
RNA sequence is a sequence which extinguishes expression of a targeted nucleic acid 
sequence in the treated animal. 

The transgene may be used to correct or ameliorate gene 
deficiencies, which may include deficiencies in which normal genes are expressed at less 
30 than normal levels or deficiencies in which the functional gene product is not expressed. 
A preferred type of transgene sequence encodes a therapeutic protein or polypeptide 
which is expressed in a host cell. The invention further includes using multiple 
transgenes, e.g., to correct or ameliorate a gene defect caused by a multi-subunit protein. 
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In certain situations, a different transgene may be used to encode each subunit of a 
protein, or to encode different peptides or proteins. This is desirable when the size of the 
DNA encoding the protein subunit is large, e.g., for an immunoglobulin, the platelet- 
derived growth factor, or a dystrophin protein. In order for the cell to produce the multi- 
5 subunit protein, a cell is infected with the recombinant virus containing each of the 
different subunits. Alternatively, different subunits of a protein may be encoded by the 
same transgene. In this case, a single transgene includes the DNA encoding each of the 
subunits, with the DNA for each subunit separated by an internal ribozyme entry site 
(IRES). This is desirable when the size of the DNA encoding each of the subunits is 

10 small, e.g., the total size of the DNA encoding the subunits and the IRES is less than five 
kilobases. As an alternative to an IRES, the DNA may be separated by sequences 
encoding a 2A peptide, which self-cleaves in a post-translational event. See, e.g., M.L. 
Donnelly, etal,J. Gen. Virol, 78(Pt 1):13-21 (Jan 1997); Furler, S, etal, Gene Ther., 
8(ll):864-873 (June 2001); Klump H, etai, Gene Ther., 8(10):81 1-817 (May 2001). 

15 This 2A peptide is significantly smaller than an IRES, making it well suited for use when 
space is a limiting factor. However, the selected transgene may encode any biologically 
active product or other product, e.g., a product desirable for study. 

Suitable transgenes may be readily selected by one of skill 
in the art. The selection of the transgene is not considered to be a limitation of this 
20 invention. 

2. . Regulatory Elements 

In addition to the major elements identified above for the 
minigene, the vector also includes conventional control elements which are operably 
linked to the transgene in a manner which permits its transcription, translation and/or 
25 expression in a cell transfected with the plasmid vector or infected with the virus 

produced by the invention. As used herein, "operably linked" sequences include both 
expression control sequences that are contiguous with the gene of interest and expression 
control sequences that act in trans or at a distance to. control the gene of interest. 

Expression control sequences include appropriate transcription 
30 initiation, termination, promoter and enhancer sequences; efficient RNA processing 
signals such as splicing and polyadenylation (polyA) signals; sequences that stabilize 
cytoplasmic mRNA; sequences that enhance translation efficiency (i.e., Kozak consensus 
sequence); sequences that enhance protein stability; and when desired, sequences that 
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enhance secretion of the encoded product. A great number of expression control 
sequences, including promoters which are native, constitutive, inducible and/or tissue- 
specific, are known in the art and may be utilized. 

Examples of constitutive promoters include, without 
—5 limitation, the retroviral Rous sarcoma virus (RSV)-LTR promoter (optionally with the 
RSV enhancer), the cytomegalovirus (CMV) promoter (optionally with the CMV 
enhancer) [see, e.g., Boshart et al, Cell, 41:521-530 (1985)], the SV40 promoter, the 
dihydrofolate reductase promoter, the p-actin promoter, the phosphoglycerol kinase 
(PGK) promoter, and the EF1 promoter [Invitrogen]. Inducible promoters allow 
1 0 regulation of gene expression and can be regulated by exogenously supplied 

compounds, environmental factors such as temperature, or the presence of a specific 
physiological state, e.g., acute phase, a particular differentiation state of the cell, or in 
replicating cells only. Inducible promoters and inducible systems are available from 
a variety of commercial sources, including, without limitation, Invitrogen, Clontech 
1 5 and Ariad. Many other systems have been described and can be readily selected by 
one of skill in the ait. Examples of inducible promoters regulated by exogenously 
supplied compounds, include, the zinc-inducible sheep metallothionine (MT) promoter, 
the dexamethasone (Dex)-inducible mouse mammary tumor virus (MMTV) promoter, the 
T7 polymerase promoter system [WO 98/10088]; the ecdysone insect promoter [No et al, 
20 Proc. Natl Acad Set USA, 93:3346-3351 (1996)], the tetracycline-repressible system 
[Gossen etal, Proa Natl Acad Sci USA, 89:5547-5551 (1992)], the tetracycline- 
inducible system [Gossen etal, Science, 268:1766-1769 (1995), see also Harvey etal, 
Cwr. Opin. Chem. Biol, 2:512-518 (1998)], the RU486-inducible system [Wang et al, 
Nat. Biotech, 15:239-243 (1997) and Wang et al, Gene Ther., 4:432-441 (1997)] and the 
25 rapamycin-inducible system [Magari etal, J. Clin, Invest., 100:2865-2872 (1997)]. Other 
types of inducible promoters which may be useful in this context are those which are 
regulated by a specific physiological state, e.g., temperature, acute phase, a particular 
differentiation state of the cell, or in replicating cells only. 

In another embodiment, the native-promoter for the 
30 transgene will be used. The native promoter may be preferred when it is desired that 
expression of the transgene should mimic the native expression. The native promoter 
may be used when expression of the transgene must be regulated temporally or 
developmental ly, or in a tissue-specific manner, or in response to specific transcriptional 
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stimuli. In a further embodiment, other native expression control elements, such as 
enhancer elements, polyadenylation sites or Kozak consensus sequences may also be used 
to mimic the native expression. 

Another embodiment of the transgene includes a gene 
5 operably linked to a-tissue-specific promoter. For instance, if expression in skeletal 
muscle is desired, a promoter active in muscle should be used. These include the 
promoters from genes encoding skeletal P-actin, myosin light chain 2A, dystrophin, 
muscle creatine kinase, as well as synthetic muscle promoters with activities higher than 
naturally-occurring promoters (see Li et al, Nat. Biotech, 17:241-245 (1999)). Examples 
10 of promoters that are tissue-specific are known for liver (albumin, Miyatake et al, J. 
Virol, 71:5124-32 (1997); hepatitis B virus core promoter, Sandig et al, Gene Ther., 
3:1002-9 (1996); alpha-fetoprotein (AFP), Arbuthnotef <rf., Hum. Gene Ther., 7:1503-14 
(1996)), bone osteocalcin (Stein etal, Mol Biol Rep,, 24:185-96 (1997)); bone 
sialoprotein (Chen et al, J. Bone Miner. Res., 11:654-64 (1996)), lymphocytes (CD2, 

15 Hansal et al,l Immunol, 161:1063-8 (1998); immunoglobulin heavy chain; T cell 

receptor chain), neuronal such as neuron-specific enolase (NSE) promoter (Andersen et 
al, Cell Mol Neurobiol, 13:503-15 (1993)), neurofilament light-chain gene (Piccioli et 
al, Proc. Natl Acad. Sci. USA, 88:561 1-5 (1991)), and the neuron-specific vgf gene 
(Piccioli et al, Neuron, 15:373-84 (1995)), among others. 

20 Optionally, plasmids carrying therapeutically useful 

transgenes may also include selectable markers or reporter genes may include sequences 
encoding geneticin, hygromicin orpurimycin resistance, among others. Such selectable 
reporters or marker genes (preferably located outside the viral genome to be rescued by 
the method of the invention) can be used to signal the presence of the plasmids in 

25 bacterial cells, such as ampicillin resistance. Other components of the plasmid may 
include an origin of replication. Selection of these and other promoters and vector 
elements are conventional and many such sequences are available [see, e.g., Sambrook et 
al, and references cited therein]. 

The combination of the transgene, promoter/enhancer, and 

30 5' and 3 5 ITRs is referred to as a "minigene" for ease of reference herein. Provided with 
the teachings of this invention, the design of such a minigene can be made by resort to 
conventional techniques. 
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3. Delivery of the Minigene to a Packaging Host Cell 

The minigene can be carried on any suitable vector, e.g., a 
plasmid, which is delivered to a host cell. The plasmids useful in this invention may be 
engineered such that they are suitable for replication and, optionally, integration in 
5 prokaryotic cells, mammalian cells, orboth. These plasmids (or other vectors carrying 
the 5' AAV ITR-heterologous molecule-3'ITR) contain sequences permitting replication 
of the minigene in eukaryotes and/or prokaryotes and selection markers for these systems. 
Selectable markers or reporter genes may include sequences encoding geneticin, 
hygromicin or purimycin resistance, among others. The plasmids may also contain 

10 certain selectable reporters or marker genes that can be used to signal the presence of the 
vector in bacterial cells, such as ampicillin resistance. Other components of the plasmid 
may include an origin of replication and an amplicon, such as the amplicon system 
employing the Epstein Barr virus nuclear antigen. This amplicon system, or other similar 
amplicon components permit high copy episomal replication in the cells. Preferably, the 

15 molecule carrying the minigene is transfected into the cell, where it may exist transiently. 
Alternatively, the minigene (carrying the 5' AAV ITR-heterologous molecule-3' ITR) 
may be stably integrated into the genome of the host cell, either chromosomally or as an 
episome. In certain embodiments, the minigene may be present .in multiple copies, 
optionally in head-to-head, head-to-tail, or tail-to-tail concatamers. Suitable transfection 

20 techniques are known and may readily be utilized to deliver the minigene to the host cell. 

Generally, when delivering the vector comprising the minigene by 
transfection, the vector is delivered in an amount from about 5 \ig to about 100 |ig DNA, 
about 10 to about 50 jag DNA to about 1 x 10 4 cells to about 1 x 10 13 cells, or about 10 5 
cells. However, the relative amounts of vector DNA to host cells may be adjusted, taking 

25 into consideration such factors as the selected vector, the delivery method and the host 
cells selected. 

B. Rep and Cap Sequences 

In addition to the minigene, the host cell contains the sequences 
which drive expression of the AAV8 capsid protein (or a capsid protein comprising a 
30 fragment of the AAV8 capsid) in the host cell and rep sequences of the same serotype as 
the serotype of the AAV ITRs found in the minigene, or a cross-complementing serotype. 
The AAV cap and rep sequences may be independently obtained from an AAV source as 
described above and may be introduced into the host cell in any manner known to one in 
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the art as described above. Additionally, when pseudotyping an AAV vector in an AAV8 
capsid, the sequences encoding each of the essential rep proteins may be supplied by 
AAV8, or the sequences encoding the rep proteins may be supplied by different AAV 
serotypes (e.g., AAV1, AAV2, AAV3, AAV4, AAV5, AAV6, AAV7, AAV9). For 
5 _ example, the repW6% sequences may be from AAV2, whereas the repS2/40 sequences 
may be from AAV8. 

In one embodiment, the host cell stably contains the capsid protein 
under the control of a suitable promoter, such as those described above. Most desirably, 
in this embodiment, the capsid protein is expressed under the control of an inducible 
10 promoter. In another embodiment, the capsid protein is supplied to the host cell in trans. 
When delivered to the host cell in trans, the capsid protein may be delivered via a 
plasmid which contains the sequences necessary to direct expression of the selected 
capsid protein in the host cell. Most desirably, when delivered to the host cell in trans, 
the plasmid carrying the capsid protein also carries other sequences required for 
1 5 packaging the rAAV, e.g., the rep sequences. 

In another embodiment, the host cell stably contains the rep 
sequences under the control of a suitable promoter, such as those described above. Most 
desirably, in this embodiment, the essential rep proteins are expressed under the control 
of an inducible promoter. In another embodiment, the rep proteins are supplied to the 
20 host cell in trans. When delivered to the host cell in trans 9 the rep proteins may be 

delivered via a plasmid which contains the sequences necessary to direct expression of 
the selected rep proteins in the host cell. Most desirably, when delivered to the host cell 
in trans, the plasmid carrying the capsid protein also carries other sequences required for 
packaging the rAAV, e.g., the rep and cap sequences. 
25 Thus, in one embodiment, the rep and cap sequences may be 

transfected into the host cell on a single nucleic acid molecule and exist stably in the cell 
as an episome. In another embodiment, the rep and cap sequences are stably integrated 
into the chromosome of the cell. Another embodiment has the rep and cap sequences 
transiently expressed in the host cell. For example, a useful nucleic acid molecule for 
30 such transfection comprises, from 5' to 3', a promoter, an optional spacer interposed 
between the promoter and the start site of the rep gene sequence, an AAV rep gene 
sequence, and an AAV cap gene sequence. 
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Optionally, the rep and/or cap sequences may be supplied on a 
vector that contains other DNA sequences that are to be introduced into the host cells. 
For instance, the vector may contain the rAAV construct comprising the minigene. The 
vector may comprise one or more of the genes encoding the helper functions, e.g., the 
5 adenoviral proteins El, E2a, and E40RF6, and the gene for VAI-RNA. 

Preferably, the promoter used in this construct may be any of the 
constitutive, inducible or native promoters known to one of skill in the art or as discussed 
above. In one embodiment, an AAV P5 promoter sequence is employed. The selection 
of the AAV to provide any of these sequences does not limit the invention. 
10 In another preferred embodiment, the promoter for rep is an 

inducible promoter, such as are discussed above in connection with the transgene 
regulatory elements. One preferred promoter for rep expression is the T7 promoter. The 
vector comprising the rep gene regulated by the T7 promoter and the cap gene, is 
transfected or transformed into a cell which either constitutively or inducibly expresses 
15 the T7 polymerase. See WO 98/10088, published March 12, 1998. 

The spacer is an optional element in the design of the vector. The 
spacer is a DNA sequence interposed between the promoter and the rep gene ATG start 
site. The spacer may have any desired design; that is, it may be a random sequence of 
nucleotides, or alternatively, it may encode a gene product, such as a marker gene. The 
20 spacer may contain genes which typically incorporate start/stop and polyA sites. The 
spacer may be a non-coding DNA sequence from a prokaryote or eukaryote, a repetitive 
non-coding sequence, a coding sequence without transcriptional controls or a coding 
sequence with transcriptional controls. Two exemplary sources of spacer sequences are 
the phage ladder sequences or yeast ladder sequences, which are available commercially, 
25 e.g., from Gibco or Invitrogen, among others. The spacer may be of any size sufficient to 
reduce expression of the rep7S and rep68 gene products, leaving the rep52> rep40 and 
cap gene products expressed at normal levels. The length of the spacer may therefore 
range from about 10 bp to about 10.0 kbp, preferably in the range of about 100 bp to 
about 8.0 kbp. To reduce the possibility of recombination, the spacer is preferably less 
30 than 2 kbp in length; however, the invention is not so limited. 

Although the molecule(s) providing rep and cap may exist in the 
host cell transiently (i.e., through transfection), it is preferred that one or both of the rep 
and cap proteins and the promoter(s) controlling their expression be stably expressed in 



18 



WO 03/052051 



PCT/US02/33630 



the host cell, e.g., as an episome or by integration into the chromosome of the host cell. 
The methods employed for constructing embodiments of this invention are conventional 
genetic engineering or recombinant engineering techniques such as those described in the 
references above. While this specification provides illustrative examples of specific 
-5*" constructs, using the information provided herein, one-of skill in the art may select and 
design other suitable constructs, using a choice of spacers, P5 promoters, and other 
elements, including at least one translational start and stop signal, and the optional 
addition of polyadenylation sites. 

In another embodiment of this invention, the rep or cap protein 
10 may be provided stably by a host cell. 

C. The Helper Functions 

The packaging host cell also requires helper functions in order to 
package the rAAV of the invention. Optionally, these functions may be supplied by a 
herpesvirus. Most desirably, the necessary helper functions are each provided from a 

15 human or non-human primate adenovirus source, such as those described above and/or 
are available from a variety of sources, including the American Type Culture Collection 
(ATCC), Manassas, VA (US). In one currently preferred embodiment, the host cell is 
provided with and/or contains an Ela gene product, an Elb gene product, an E2a gene 
product, and/or an E4 ORf 6 gene product. The host cell may contain other adenoviral 

20 genes such as VAI RNA, but these genes are not required. In a preferred embodiment, no 
other adenovirus genes or gene functions are present in the host cell. 

By "adenoviral DNA which-expresses the Ela gene product", it is 
meant any adenovirus sequence encoding Ela or any functional Ela portion. Adenoviral 
DNA which expresses the E2a gene product and adenoviral DNA which expresses the E4 

25 ORF6 gene products are defined similarly. Also included are any alleles or other 

modifications of the adenoviral gene or functional portion thereof. Such modifications 
may be deliberately introduced by resort to conventional genetic engineering or 
mutagenic techniques to enhance the adenoviral function in some manner, as well as 
naturally occurring allelic variants thereof. Such modifications and methods for 

30 manipulating DNA to achieve these adenovirus gene functions are known to those of skill 
in the art. 

The adenovirus Ela, Elb, E2a, and/or E40RF6 gene products, as 
well as any other desired helper functions, can be provided using any means that allows 
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their expression in a cell. Each of the sequences encoding these products may be on a 
separate vector, or one or more genes may be on the same vector. The vector may be any 
vector known in the art or disclosed above, including plasmids, cosmids and viruses. 
Introduction into the host cell of the vector may be achieved by any means known in the 
— 5 art or as disclosed above, including transfection, infection, electroporation, liposome 

delivery, membrane fusion techniques, high velocity DNA-coated pellets, viral infection 
and protoplast fusion, among others. One or more of the adenoviral genes may be stably 
integrated into the genome of the host cell, stably expressed as episomes, or expressed 
transiently. The gene products may all be expressed transiently, on an episome or stably 

10 integrated, or some of the gene products may be expressed stably while others are 

expressed transiently. Furthermore, the promoters for each of the adenoviral genes may 
be selected independently from a constitutive promoter, an inducible promoter or a native 
adenoviral promoter. The promoters may be regulated by a specific physiological state of . 
the organism or cell (i.e., by the differentiation state or in replicating or quiescent cells) or 

15 by exogenously added factors, for example. 

D. Host Cells And Packaging Cell Lines 

The host cell itself may be selected from any biological organism, 
including prokaryotic (e.g., bacterial) cells, and eukaryotic cells, including, insect cells, 
yeast cells and mammalian cells. Particularly desirable host cells are selected from 

20 among any mammalian species, including, without limitation, cells such as A549, WEHI, 
3T3, 10T1/2, BHK, MDCK, COS 1, COS 7, BSC 1, BSC 40, BMT 10, VERO, WI38, 
HeLa, 293 cells (which express functional adenoviral El), Saos, C2C12, L cells, HT1080, 
HepG2 and primary fibroblast, hepatocyte and myoblast cells derived from mammals 
including human, monkey, mouse, rat, rabbit, and hamster. The selection of the 

25 mammalian species providing the cells is not a limitation of this invention; nor is the type 
of mammalian cell, i.e., fibroblast, hepatocyte, tumor cell, etc. The requirements for the 
cell used is that it not carry any adenovirus gene other than El, E2a and/or E4 ORF6; it 
not contain any other virus gene which could result in homologous recombination of a 
contaminating virus during the production of rAAV; and it is capable of infection or 

30 transfection of DNA and expression of the transfected DNA. In a preferred embodiment, 
the host cell is one that has rep and cap stably transfected in the cell. 

One host cell useful in the present invention is a host cell stably 
transformed with the sequences encoding rep and cap, and which is transfected with the 
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adenovirus El, E2a, and E40RF6 DNA and a construct carrying the minigene as 
described above. Stable rep and/or cap expressing cell lines, such as B-50 
(PCT/US98/19463), or those described in U.S. Patent No. 5,658,785, may also be 
similarly employed. Another desirable host cell contains the minimum adenoviral DNA 
5 which is sufficient to express E4 ORF6. Yet other cell lines can be constructed using-the 
AAV8 rep and/or AAV8 cap sequences of the invention. 

The preparation of a host cell according to this invention involves 
techniques such as assembly of selected DNA sequences. This assembly may be 
accomplished utilizing conventional techniques. Such techniques include cDNA and 
10 genomic cloning, which are well known and are described in Sambrook et al,, cited 
above, use of overlapping oligonucleotide sequences of the adenovirus and AAV 
genomes, combined with polymerase chain reaction, synthetic methods, and any other 
suitable methods which provide the desired nucleotide sequence. 

Introduction of the molecules (as plasmids or viruses) into the host 
15 cell may also be accomplished using techniques known to the skilled artisan and as 

discussed throughout the specification. In preferred embodiment, standard transfection 
techniques are used, e.g., CaP04 transfection or electroporation, and/or infection by 
hybrid adenovirus/ AAV vectors into cell lines such as the human embryonic kidney cell 
line HEK 293 (a human kidney cell line containing functional adenovirus El genes which 
20 provides tirms-actmg El proteins). 

The AAV8 based vectors which are generated by one of skill in the art are 
beneficial for gene delivery to selected host cells and gene therapy patients since no 
neutralization antibodies to AAV8 have been found in the human population. One of 
skill in the art may readily prepare other rAAV viral vectors containing the AAV8 capsid 
25 proteins provided herein using a variety of techniques known to those of skill in the art. 
One may similarly prepare still other rAAV viral vectors containing AAV8 sequence and 
AAV capsids of another serotype. 

One of skill in the art will readily understand that the AAV8 sequences of 
the invention can be readily adapted for use in these and other viral vector systems for in 
30 vitro, ex vivo or in vivo gene delivery. Similarly, one of skill in the art can readily select 
other fragments of the AAV8 genome of the invention for use in a variety of rAAV and 
non-rAAV vector systems. Such vectors systems may include, e.g., lentiviruses, 
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retroviruses, poxviruses, vaccinia viruses, and adenoviral systems, among others. 
Selection of these vector systems is not a limitation of the present invention. 

Thus, the invention further provides vectors generated using the nucleic 
acid and amino acid sequences of the novel AAV of the invention. Such vectors are 
5 useful for a variety of purposes, including fordelivery of therapeutic molecules and for 
use in vaccine regimens. Particularly desirable for delivery of therapeutic molecules are 
recombinant AAV containing capsids of the novel AAV of the invention. These, or other 
vector constructs containing novel AAV sequences of the invention may be used in 
vaccine regimens, e.g., for co-delivery of a cytokine, or for delivery of the immunogen 
10 itself. 

V. Recombinant Viruses And Uses Therefor 

Using the techniques described herein, one of skill in the art can generate a rAAV 
having a capsid of a serotype 8 of the invention or having a capsid containing one or more 

15 fragments of AAVS. In one embodiment, a full-length capsid from a single serotype, e.g., 
AAV8 [SEQ ID NO: 2] can be utilized. In another embodiment, a full-length capsid may 
be generated which contains one or more fragments of AAVS fused in frame with 
sequences from another selected AAV serotype, or from heterologous portions of AAVS. 
For example, a rAAV may contain one or more of the novel hypervariable region 

20 sequences of AAVS. Alternatively, the unique AAVS sequences of the invention may be 
used in constructs containing other viral or non-viral sequences. Optionally, a 
recombinant virus may cany AAVS rep sequences encoding one or more of the AAVS 
rep proteins. 

A. Delivery of Viruses 

25 In another aspect, the present invention provides a method for delivery of 

a transgene to a host which involves transfecting or infecting a selected host cell with a 
recombinant viral vector generated with the AAVS sequences (or functional fragments 
thereof) of the invention. Methods for delivery are well known to those of skill in the art 
and are not a limitation of the present invention. 

30 In one desirable embodiment, the invention provides a method for AAV8 

mediated delivery of a transgene to a host. This method involves transfecting or infecting 
a selected host cell with a recombinant viral vector containing a selected transgene under 
the control of sequences which direct expression thereof and AAVS capsid proteins. 



22 



WO 03/052051 



PCT/US02/33630 



Optionally, a sample from the host may be first assayed for the presence 
of antibodies to a selected AAV serotype. A variety of assay formats for detecting 
neutralizing antibodies are well known to those of skill in the art. The selection of such 
an assay is not a limitation of the present invention. See, e.g., Fisher et al, Nature Med, 
5 3(3):306-3-l~2 (March 1997) and W. C. Manning et al, Human Gene Therapy, 9:477-485 
(March 1, 1998). The results of this assay may be used to determine which AAV vector 
containing capsid proteins of a particular serotype are preferred for delivery, e.g., by the 
absence of neutralizing antibodies specific for that capsid serotype. 

In one aspect of this method, the delivery of vector with AAV8 capsid 

10 proteins may precede or follow delivery of a gene via a vector with a different serotype 
AAV capsid protein. Thus, gene delivery via rAAV vectors may be used for repeat gene 
delivery to a selected host cell. Desirably, subsequently administered rAAV vectors carry 
the same transgene as the first rAAV vector, but the subsequently administered vectors 
contain capsid proteins of serotypes which differ from the first vector. For example, if a 

15 first vector has AAV8 capsid proteins, subsequently administered vectors may have 
capsid proteins selected from among the other serotypes. 

The above-described recombinant vectors may be delivered to host cells 
according to published methods. The rAAV, preferably suspended in a physiologically 
compatible carrier, may be administered to a human or non-human mammalian patient. 

20 Suitable carriers may be readily selected by one. of skill in the art in view of the indication 
for which the transfer virus is directed. For example, one suitable carrier includes saline, 
which may be formulated with a variety of buffering solutions (e.g., phosphate buffered 
saline). Other exemplary carriers include sterile saline, lactose, sucrose, calcium 
phosphate, gelatin, dextran, agar, pectin, peanut oil, sesame oil, and water. The selection 

25 of the carrier is not a limitation of the present invention. 

Optionally, the compositions of the invention may contain, in addition to 
the rAAV and carrier(s), other conventional pharmaceutical ingredients, such as 
preservatives, or chemical stabilizers. Suitable exemplary preservatives include 
chlorobutanol, potassium sorbate, sorbic acid, sulfur dioxide, propyl gallate, the parabens, 

30 ethyl vanillin, glycerin, phenol, and parachlorophenol. Suitable chemical stabilizers 
include gelatin and albumin. 

The vectors are administered in sufficient amounts to transfect the cells 
and to provide sufficient levels of gene transfer and expression to provide a therapeutic 
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benefit without undue adverse effects, or with medically acceptable physiological effects, 
which can be determined by those skilled in the medical arts. Conventional and 
pharmaceutical^ acceptable routes of administration include, but are not limited to, direct 
delivery to a desired organ (e.g., the liver or lung), oral, inhalation, intranasal, 
5 intratracheal, intraocular, intravenous, intramuscular, subcutaneous, intradermal, and 
other parental routes of administration. Routes of administration may be combined, if 
desired. 

Dosages of the viral vector will depend primarily on factors such as the 
condition being treated, the age, weight and health of the patient, and may thus vary 

1 0 among patients. For example, a therapeutically effective human dosage of the viral vector 
is generally in the range of from about 1 ml to about 100 ml of solution containing 
concentrations of from about 1 x 10 9 to 1 x 10 16 genomes virus vector. A preferred 
human dosage may be about 1 x 10 13 to 1 x 10 16 AAV genomes. The dosage will be 
adjusted to balance the therapeutic benefit against any side effects and such dosages may 

1 5 vary depending upon the therapeutic application for which the recombinant vector is 
employed. The levels of expression of the transgene can be monitored to determine the 
frequency of dosage resulting in viral vectors, preferably AAV vectors containing the 
minigene. Optionally, dosage regimens similar to those described for therapeutic 
purposes may be utilized for immunization using the compositions of the invention. 

20 Examples of therapeutic products and immunogenic products for delivery 

by the AAV8-containing vectors of the invention are provided below. These vectors may 
be used for a variety of therapeutic or vaccinal regimens, as described herein. 
Additionally, these vectors may be delivered in combination with one or more other 
vectors or active ingredients in a desired therapeutic and/or vaccinal regimen. 

25 B. Therapeutic Transgenes 

Useful therapeutic products encoded by the transgene include hormones 
and growth and differentiation factors including, without limitation, insulin, glucagon, 
growth hormone (GH), parathyroid hormone (PTH), growth hormone releasing factor 
(GRF), follicle stimulating hormone (FSH), luteinizing hormone (LH), human chorionic 

30 gonadotropin (hCG), vascular endothelial growth factor (VEGF), angiopoietins, 
angiostatin, granulocyte colony stimulating factor (GCSF), erythropoietin (EPO), 
connective tissue growth factor (CTGF), basic fibroblast growth factor (bFGF), acidic 
fibroblast growth factor (aFGF), epidermal growth factor (EGF), platelet-derived growth 



24 



WO 03/052051 



PCT/US02/33630 



factor (PDGF), insulin growth factors I and II (IGF-I and IGF-II), any one of the 
transforming growth factor a superfamily, including TGFa, activins, inhibins, or any of 
the bone morphogenic proteins (BMP) BMPs 1-15, any one of the 
heregluin/neuregulin/ARIA/neu differentiation factor (NDF) family of growth factors, 
5 nerve growth factor (NGF), brain-derived neurotrophic factor (BDNF), neurotrophins 
NT- 3 and NT-4/5, ciliary neurotrophic factor (CNTF), glial cell line derived neurotrophic 
factor (GDNF), neurturin, agrin, any one of the family of semaphorins/collapsins, netrin-1 
and netrin-2, hepatocyte growth factor (HGF), ephrins, noggin, sonic hedgehog and 
tyrosine hydroxylase. 

10 Other useful transgene products include proteins that regulate the immune 

system including, without limitation, cytokines and lymphokines such as thrombopoietin 
(TPO), interleukins (IL) IL-1 through IL-18, monocyte chemoattractant protein, leukemia 
inhibitory factor, granulocyte-macrophage colony stimulating factor, Fas ligand, tumor 
necrosis factors a and p, interferons a, p, and y, stem cell factor, flk-2/flt3 ligand. 

15 Gene products produced by the immune system are also useful in the invention. These 
include, without limitations, immunoglobulins IgG, IgM, IgA, IgD and IgE, chimeric 
immunoglobulins, humanized antibodies, single chain antibodies, T cell receptors, 
chimeric T cell receptors, single chain T cell receptors, class I and class II MHC 
molecules, as well as engineered immunoglobulins and MHC molecules. Useful gene 

20 products also include complement regulatory proteins such as complement regulatory 
proteins, membrane cofactor protein (MCP), decay accelerating factor (DAF), CR1, CF2 
and CD59. 

Still other useful gene products include any one of the receptors for the 
hormones, growth factors, cytokines, lymphokines, regulatory proteins and immune 

25 system proteins. The invention encompasses receptors for cholesterol regulation, 

including the low density lipoprotein (LDL) receptor, high density lipoprotein (HDL) 
receptor, the very low density lipoprotein (VLDL) receptor, and the scavenger receptor. 
The invention also encompasses gene products such as members of the steroid hormone 
receptor superfamily including glucocorticoid receptors and estrogen receptors, Vitamin 

30 D receptors and other nuclear receptors. In addition, useful gene products include 

transcription factors such as jun 9 fos, max, mad, serum response factor (SRF), AP-1, AP2, 
myb, MyoD and myogenin, ETS-box containing proteins, TFE3, E2F, ATF1, ATF2, 
ATF3, ATF4, ZF5, NFAT, CREB, HNF-4, C/EBP, SP1, CCAAT-box binding proteins, 
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interferon regulation factor (IRF-1), Wilms tumor protein, ETS-binding protein, STAT, 
GATA-box binding proteins, e.g., GATA-3, and the forkhead family of winged helix 
proteins. 

Other useful gene products include, carbamoyl synthetase I, ornithine 
5 transcarbamylase, arginosuccinate synthetase, arginosuccinate- lyase, arginase, 

fumarylacetacetate hydrolase, phenylalanine hydroxylase, alpha-1 antitrypsin, glucose-6- 
phosphatase, porphobilinogen deaminase, factor VIII, factor IX, cystathione beta- 
synthase, branched chain ketoacid decarboxylase, albumin, isovaleryl-coA 
dehydrogenase, propionyl CoA carboxylase, methyl malonyl CoA mutase, glutaryl CoA 
1 0 dehydrogenase, insulin, beta-glucosidase, pyruvate carboxylate, hepatic phosphorylase, 
phosphorylase kinase, glycine decarboxylase, H-protein, T-protein, a cystic fibrosis 
transmembrane regulator (CFTR) sequence, and a dystrophin cDNA sequence. Still other 
useful gene products include enzymes such as may be useful in enzyme replacement 
therapy, which is useful in a variety of conditions resulting from deficient activity of 
1 5 enzyme. For example, enzymes that contain mannose-6-phosphate may be utilized in 
therapies for lysosomal storage diseases (e.g., a suitable gene includes that encoding p- 
glucuronidase (GUSB)). 

Other useful gene products include non-naturally occurring polypeptides, 
such as chimeric or hybrid polypeptides having a non-naturally occurring amino acid 
20 sequence containing insertions, deletions or amino acid substitutions. For example, 
single-chain engineered immunoglobulins could be useful in certain 
immunocompromised patients. Other types of non-naturally occurring gene sequences 
include antisense molecules and catalytic nucleic acids, such as ribozymes, which could 
be used to reduce overexpression of a target. 
25 Reduction and/or modulation of expression of a gene is particularly 

desirable for treatment of hyperproliferative conditions characterized by 
hyperproliferating ceils, as are cancers and psoriasis. Target polypeptides include those 
polypeptides which are produced exclusively or at higher levels in hyperproliferative cells 
as compared to normal cells. Target antigens include polypeptides encoded by oncogenes 
30 such as myb, myc, fyn, and the translocation gene bcr/abl, ras, src, P53, neu, trie and 
EGRF. In addition to oncogene products as target antigens, target polypeptides for 
anti-cancer treatments and protective regimens include variable regions of antibodies 
made by B cell lymphomas and variable regions of T cell receptors of T cell lymphomas 
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which, in some embodiments, are also used as target antigens for autoimmune disease. 
Other tumor-associated polypeptides can be used as target polypeptides such as 
polypeptides which are found at higher levels in tumor cells including the polypeptide 
recognized by monoclonal antibody 17-1 A and folate binding polypeptides. 
5 Other suitable therapeutic polypeptides and proteins include those which 

may be useful for treating individuals suffering from autoimmune diseases and disorders 
by conferring a broad based protective immune response against targets that are 
associated with autoimmunity including cell receptors and cells which produce "self- 
directed antibodies. T cell mediated autoimmune diseases include Rheumatoid arthritis 

10 (RA), multiple sclerosis (MS), Sjogren's syndrome, sarcoidosis, insulin dependent 
diabetes mellitus (IDDM), autoimmune thyroiditis, reactive arthritis, ankylosing 
spondylitis, scleroderma, polymyositis, dermatomyositis, psoriasis, vasculitis, Wegener's 
granulomatosis, Crohn's disease and ulcerative colitis. Each of these diseases is 
characterized by T cell receptors (TCRs) that bind to endogenous antigens and initiate the 

15 inflammatory cascade associated with autoimmune diseases. 
C. Immunogenic Transgenes 

Alternatively, or in addition, the vectors of the invention may contain 
AAV sequences of the invention and a transgene encoding a peptide, polypeptide or 
protein which induces an immune response to a selected immunogen. For example, 

20 immunogens may be selected from a variety of viral families. Example of desirable viral 
families against which an immune response would be desirable include, the picornavirus 
family, which includes the genera rhinoviruses, which are responsible for about 50% of 
cases of the common cold; the genera enteroviruses, which include polioviruses, 
coxsackieviruses, echoviruses, and human enteroviruses such as hepatitis A virus; and the 

25 genera apthoviruses, which are responsible for foot and mouth diseases, primarily in non- 
human animals. Within the picornavirus family of viruses, target antigens include the 
VP1, VP2, VP3, VP4, and VPG. Another viral family includes the calcivirus family, 
which encompasses the Norwalk group of viruses, which are an important causative agent 
of epidemic gastroenteritis. Still another viral family desirable for use in targeting 

30 antigens for inducing immune responses in humans and non-human animals is the 

togavirus family, which includes the genera alphavirus, which include Sindbis viruses, 
RossRiver virus, and Venezuelan, Eastern & Western Equine encephalitis, and rubivirus, 
including Rubella virus. The flaviviridae family includes dengue, yellow fever, Japanese 
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encephalitis, St. Louis encephalitis and tick borne encephalitis viruses. Other target 
antigens may be generated from the Hepatitis C or the coronavirus family, which includes 
a number of non-human viruses such as infectious bronchitis virus (poultry), porcine 
transmissible gastroenteric virus (pig), porcine hemagglutinatin encephalomyelitis virus 
5 (pig), feline infectious peritonitis virus (cats), -feline enteric coronavirus (cat), canine 
coronavirus (dog), and human respiratory coronaviruses, which may cause the common 
cold and/or non-A, B or C hepatitis. Within the coronavirus family, target antigens 
include the El (also called M or matrix protein), E2 (also called S or Spike protein), E3 
(also called HE or hemagglutin-elterose) glycoprotein (not present in all coronaviruses), 
10 or N (nucleocapsid). Still other antigens may be targeted against the rhabdovirus family, 
which includes the genera vesiculovirus (e.g., Vesicular Stomatitis Virus), and the general 
lyssavirus (e.g., rabies). Within the rhabdovirus family, suitable antigens may be derived 
from the G protein or the N protein. The family filoviridae, which includes hemorrhagic 
fever viruses such as Marburg and Ebola virus may be a suitable source of antigens. The 
1 5 paramyxovirus family includes parainfluenza Virus Type 1 , parainfluenza Virus Type 3, 
bovine parainfluenza Virus Type 3, rubulavirus (mumps virus, parainfluenza Virus Type 
2, parainfluenza virus Type 4, Newcastle disease virus (chickens), rinderpest, 
morbillivirus, which includes measles and canine distemper, and pneumovirus, which 
includes respiratory syncytial virus. The influenza virus is classified within the family 
20 orthomyxovirus and is a suitable source of antigen (e.g., the HA protein, the Nl protein). 
The bunyavirus family includes the genera bunyavirus (California encephalitis, La 
Crosse), phlebovirus (Rift Valley Fever), hantavirus (puremala is a hemahagin fever 
virus), nairovirus (Nairobi sheep disease) and various unassigned bungaviruses. The 
arenavirus family provides a source of antigens against LCM and Lassa fever virus. The 
25 reovirus family includes the genera reovirus, rotavirus (which causes acute gastroenteritis 
in children), orbi viruses, and cultivirus (Colorado Tick fever, Lebombo (humans), equine 
encephalosis, blue tongue). The retrovirus family includes the sub-family oncorivirinal 
which encompasses such human and veterinary diseases as feline leukemia virus, HTLVI 
and HTLVII, lentivirinal (which includes HIV, simian immunodeficiency virus, feline 
30 immunodeficiency virus, equine infectious anemia virus, and spumavirinal). The 

papovavirus family includes the sub-family polyomaviruses (BKU and JCU viruses) and 
the sub-family papillomavirus (associated with cancers or malignant progression of 
papilloma). The adenovirus family includes viruses (EX, AD7, ARD, O.B.) which cause 
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respiratory disease and/or enteritis. The parvovirus family feline parvovirus (feline 
enteritis), feline panleucopeniavirus, canine parvovirus, and porcine parvovirus. The 
herpesvirus family includes the sub-family alphaherpesvirinae, which encompasses the 
genera simplexvirus (HSVI, HSVII), varicellovirus (pseudorabies, varicella zoster) and 
5 the sub-family betaherpesvirinae r which includes the genera cytomegalovirus (HCMV, — 
muromegalovirus) and the sub-family gammaherpesvirinae, which includes the genera 
lymphocryptovirus, EBV (Burkitts lymphoma), infectious rhinotracheitis, Marek's 
disease virus, and rhadinovirus. The poxvirus family includes the sub-family 
chordopoxvirinae, which encompasses the genera orthopoxvirus (Variola major 
10 (Smallpox) and Vaccinia (Cowpox)), parapoxvirus, avipoxvirus, capripoxvirus, 

leporipoxvirus, suipoxvirus, and the sub-family entomopoxvirinae. The hepadnavirus 
family includes the Hepatitis B virus. One unclassified virus which may be suitable 
source of antigens is the Hepatitis delta virus. Another virus which is a source of antigens 
is Nipan Virus. Still other viral sources may include avian infectious bursal disease virus 
15 and porcine respiratory and reproductive syndrome virus. The alphavirus family includes 
equine arteritis virus and various Encephalitis viruses. 

The present invention may also encompass immunogens which are useful 
to immunize a human or non-human animal against other pathogens including bacteria, 
fungi, parasitic microorganisms or multicellular parasites which infect human and non- 
20 human vertebrates, or from a cancer cell or tumor cell. Examples of bacterial pathogens 
include pathogenic gram-positive cocci include pneumococci; staphylococci (and the 
toxins produced thereby, e.g., enterotoxin B); and streptococci. Pathogenic 
gram-negative cocci include meningococcus; gonococcus. Pathogenic enteric 
gram-negative bacilli include enterobacteriaceae; pseudomonas, acinetobacteria and 
25 eikenella; melioidosis; salmonella; shigella; haemophilus; moraxella; H. ducreyi (which 
causes chancroid); brucella species (brucellosis); Francisella tulavemis (which causes 
tularemia); Yersinia pestis (plague) and other yersinia (pasteurella); streptobacillus 
moniliformis and spirillum; Gram-positive bacilli include listeria monocytogenes; 
erysipelothrix rhusiopathiae; Coiymbacteriwn diphtheria (diphtheria); cholera; B. 
30 anthracis (anthrax); donovanosis (granuloma inguinale); and bartonellosis. Diseases 

caused by pathogenic anaerobic bacteria include tetanus; botulism (Clostriditm botulimtm 
and its toxin); Clostridium perfringem and its epsilon toxin; other Clostridia; 
tuberculosis; leprosy; and other mycobacteria. Pathogenic spirochetal diseases include 
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syphilis; treponematoses: yaws, pinta and endemic syphilis; and leptospirosis. Other 
infections caused by higher pathogen bacteria and pathogenic fungi include glanders 
{Burkholderia mallei); actinomycosis; nocardiosis; cryptococcosis, blastomycosis, 
histoplasmosis and coccidioidomycosis; candidiasis, aspergillosis, and mucormycosis; 
5 sporotrichosis; paracoccidiodomycosis, petriellidiosis, torulopsosis, mycetoma and 
chromomycosis; and dermatophytosis. Rickettsial infections include Typhus fever, 
Rocky Mountain spotted fever, Q fever (Coxiella burnetii), and Rickettsialpox. Examples 
of mycoplasma and chlamydial infections include: mycoplasma pneumoniae; 
lymphogranuloma venereum; psittacosis; and perinatal chlamydial infections. 
10 Pathogenic eukaryotes encompass pathogenic protozoans and helminths and infections 
produced thereby include: amebiasis; malaria; leishmaniasis; trypanosomiasis; 
toxoplasmosis; Pneumocystis carinii; Trichans; Toxoplasma gondii; babesiosis; 
giardiasis; trichinosis; filariasis; schistosomiasis; nematodes; trematodes or flukes; and 
cestode (tapeworm) infections. 
15 Many of these organisms and/or the toxins produced thereby have been identified 

by the Centers for Disease Control. [(CDC), Department of Heath and Human Services, 
USA], as agents which have potential for use in biological attacks. For example, some of 
these biological agents, include, Bacillus anthracis (anthrax), Clostridium botulinum and 
its toxin (botulism), Yersinia pestis (plague), variola major (smallpox), Francisella 
20 tularensis (tularemia), and viral hemorrhagic fevers [filoviruses (e.g., Ebola, Marburg], 
and arenaviruses [e.g., Lassa, Machupo]), all of which are currently classified as Category 
A agents; Coxiella burnetii (Q fever); Brucella species (brucellosis), Burkholderia mallei 
(glanders), Burkholderia pseudomallei (meloidosis), Ricinus communis and its toxin 
(ricin toxin), Clostiidium perfringens and its toxin (epsilon toxin), Staphylococcus species 
25 and their toxins (enterotoxin B), Chlamydia psiitaci (psittacosis), water safety threats 

(e.g., Vibrio cholerae, Cryptosporidium parvum), Typhus fever (Richettsia powazekii), and 
viral encephalitis (alphaviruses, e.g., Venezuelan equine encephalitis; eastern equine 
encephalitis; western equine encephalitis); all of which are currently classified as 
Category B agents; and Nipan virus and hantaviruses, which are currently classified as 
30 Category C agents. In addition, other organisms, which are so classified or differently 
classified, may be identified and/or used for such a purpose in the future. It will be 
readily understood that the viral vectors and other constructs described herein are useful 
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to deliver antigens from these organisms, viruses, their toxins or other by-products, which 
will prevent and/or treat infection or other adverse reactions with these biological agents. 

Administration of the vectors of the invention to deliver immunogens 
against the variable region of the T cells elicit an immune response including CTLs to 
5 eliminate those~T cells. In rheumatoid arthritis (RA), several specific variable regions of 
TCRs which are involved in the disease have been characterized. These TCRs include 
V-3, V-14, V-17 and V-17. Thus, delivery of a nucleic acid sequence that encodes at 
least one of these polypeptides will elicit an immune response that will target T cells 
involved in RA. In multiple sclerosis (MS), several specific variable regions of TCRs 
10 which are involved in the disease have been characterized. These TCRs include V-7 and 
V-10. Thus, delivery of a nucleic acid sequence that encodes at least one of these 
polypeptides will elicit an immune response that will target T cells involved in MS. In 
scleroderma, several specific variable regions of TCRs which are involved in the disease 
have been characterized. These TCRs include V-6, V-8, V-14 and V-16, V-3C, V-7, 

15 V-14, V-15, V-16, V-28 and V-12. Thus, delivery of a nucleic acid molecule that 

encodes at least one of these polypeptides will elicit an immune response that will target 
T cells involved in scleroderma. 

Thus, a rAAV8-derived recombinant viral vector of the invention 
provides an efficient gene transfer vehicle which can deliver a selected transgene to a 

20 selected host cell in vivo or ex vivo even where the organism has neutralizing antibodies 
to one or more AAV serotypes. In one embodiment, the rAAV and the cells are mixed ex 
vivo; the infected cells are cultured using conventional methodologies; and the transduced 
cells are re-infused into the patient. 

These compositions are particularly well suited to gene delivery for 

25 therapeutic purposes and for immunization, including inducing protective immunity. 
Further, the compositions of the invention may also be used for production of a desired 
gene product in vitro. For in vitro production, a desired product (e.g., a protein) may be 
obtained from a desired culture following transfection of host cells with a rAAV 
containing the molecule encoding the desired product and culturing the cell culture under 

30 conditions which permit expression. The expressed product may then be purified and 
isolated, as desired. Suitable techniques for transfection, cell culturing, purification, and 
isolation are known to those of skill in the art. 
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The following examples illustrate several aspects and embodiments of the 
invention. 

EXAMPLES 

5 Example 1 : ProducTibn of Recombinant AAV8 Viral Genomes Equipped With AAV2 
ITRs 

Chimeric packaging constructs are generated by fusing AAV2 rep with cap 
sequences of novel AAV serotypes. These chimeric packaging constructs are used, 
initially, for pseudotyping recombinant AAV genomes carrying AAV2 ITRs by triple 
10 transfection in 293 cell using Ad5 helper plasmid. These pseudotyped vectors are used to 
evaluate performance in transduction-based serological studies and evaluate gene transfer 
efficiency of novel AAV serotypes in different animal models including NHP and 
rodents, before intact and infectious viruses of these novel serotypes are isolated. 

A. pAAV2GFP 

15 The AAV2 plasmid which contains the AAV2 ITRs and green fluorescent 

protein expressed under the control of a constitutive promoter. This plasmid contains the 
following elements: the AAV2 ITRs, a CMV promoter and the GFP coding sequences. 

B. Cloning of trans plasmid 
To construct the chimeric trans-plasmid for production of recombinant 

pseudotyped AAV8 vectors, p5El 8 plasmid (Xiao et al % 1999, J. Virol 73:3994-4003) 
was partially digested with Xho I to linearize the plasmid at the Xho I site at the position 
of 3169 bp only. The Xho I cut ends were then filled in and ligated back. This modified 
p5E18 plasmid was restricted with Xba I and Xho I in a complete digestion to remove the 
AAV2 cap gene sequence and replaced with a 2267 bp Spe I/Xho I fragment containing 
the AAV8 cap gene which was isolated- from pCRAAVS 6-5+15-4 plasmid. 

The resulting plasmid contains the AAV2 rep sequences for Rep78/68 
under the control of the AAV2 P5 promoter, and the AAV2 rep sequences for Rep52/40 
under the control of the AAV2 PI 9 promoter. The AAV9 capsid sequences are under the 
control of the AAV2 P40 promoter, which is located within the Rep sequences. This 
plasmid further contains a spacer 5' of the rep ORF. 

Alternatively, a similar plasmid can be constructed which utilizes the 
AAV8 rep sequences and the native AAV8 promoter sequences. This plasmid is then 
used for production of rAAV8, as described herein. 
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C. Production of Pseudotyped rAA V 

The rAAV particles (AAV2 vector in AAV8 capsid) are generated using 
an adenovirus-free method. Briefly, the cis plasmid (pAAV2.1 lacZ plasmid containing 
AAV2 ITRs), and the trans plasmid pCRAAVS 6-5+15-4 (containing the AAV2 rep and 
5 AAVScap) and a helper plasmid, respectively, were simultaneouslyco-transfected into 
293 cells in a ratio of 1 : 1 :2 by calcium phosphate precipitation. 

For the construction of the pAd helper plasmids, pBGl 0 plasmid was 
purchased from Microbix (Canada). A Rsrll fragment containing L2 and L3 was deleted 
from pBHGlO, resulting in the first helper plasmid, pAdAFl 3. Plasmid Ad Fl was 
10 constructed by cloning Asp700/Sall fragment with a Pmel/Sgfl deletion, isolating from 
pBHGlO, into Bluescript. MLP, L2 5 L2 and L3 were deleted in the pAdAFl. Further 
deletions of a 2.3 kb Nrul fragment and, subsequently, a 0.5 kb Rsrll/Nrul fragment 
generated helper plasmids pAdAF5 and pAdAF6, respectively. The helper plasmid, 
termed pAF6, provides the essential helper functions of E2a and E4 ORF6 not provided 
15 by the El-expressing helper cell, but is deleted of adenoviral capsid proteins and 
functional El regions). 

Typically, 50 \xg of DNA (cis:trans:helper) was transfected onto a 150 
mm tissue culture dish. The 293 cells were harvested 72 hours post-transfection, 
sonicated and treated with 0.5% sodium deoxycholate (37°C for 10 min.) Cell lysates 
20 were then subjected to two rounds of a CsCl gradient. Peak fractions containing rAAV 
vector are collected, pooled and dialyzed against PBS. 

Example 2 - Evaluation of Vectors with AAV8 Capsids 

Vectors based on AAV1 (2/1), AAV5 (2/5) and AAV2 (2/2) were developed 

25 essentially as described for AAV8 in Example L Genome copy (GC) titers of AAV 
vectors were determined by TaqMan analysis using probes and primers targeting S V40 
poly A region as described previously [Gao, G., et al., (2000) Hum Gene Ther 11, 2079- 
91]. Recombinant virions were recovered by CsC^ sedimentation in all cases except 
AAV2/2, which was purified by heparin chromatography. 

30 Vectors were constructed for each serotype for a number of in vitro and in vivo 

studies. Eight different transgene cassettes were incorporated into the vectors and 
recombinant virions were produced for each serotype. The recovery of virus, based on 
genome copies, is summarized in Table 1 . The yields of vector were high for each 

33 



WO 03/052051 



PCT/US02/33630 



are 



serotype with no consistent differences between serotypes. Data presented in the table 
average genome copy yields with standard deviation x 1 0 13 of multiple production lots of 
50 plate (150 mm) transfections. 

5 Table 1. Production of Recombinant Vectors 



CMV 
LacZ 


AAV2/1 


AAV2/2 


AAV2/5 


AAV2/8 


7.30 + 4.33 
(n=9) 


4.49 + 2.89 
(n=6) 


5.19 + 5.19 
(n=8) 


0.87 
(n=l) 


CMV 
EGFP 


6.43 + 2.42 
(n=2) 


3.39 + 2.42 
(n=2) 


5.55 + 6.49 
(n=4) 


3.74 + 3.88 
(n=2) 


TBG LacZ 


4.18 
(n-1) 


0.23 
(n=l) 


0.704 + 0.43 
(n=2) 


0.532 
(n-1) 


Alb A1AT 


4.67 + 0.75 
(n=2) 


4.77 
(n=l) 


4.09 
(n=l) 


2.02 
(n-1) 


CB A1AT 


0.567 
(n=l) 


0.438 
(n=l) 


2.82 
(n-D 


0.816 + 0.679 
(n=2) 


CMV 
rhCG 


8.78 + 2.37 
(n=7) 


1.43+1.18 
(n=2) 


1.63 + 1.15 
(n=3) 


1.32 + 0.87 
(n=3) 


TBG 
rhCG 


8.51+6.65 
(n=6) 


3.47 + 2.09 
(n=5) 


5.26 + 3.85 
(n=4) 


1.83 + 0.98 ' 
(n=5) 


TBG cFIX 


1.24+ 1.29 
(n=3) 


0.63 + 0.394 
(n=6) 


3.74 + 2.48 
(n=7) 


15.8 + 15.0 
(n=5) 



Example 3 - Serologic Analysis of Pseudotyped Vectors 

C57BL/6 mice were injected with vectors of different serotypes of AAVCBA1AT 

10 vectors intramuscularly (5 x 10" GC) and serum samples were collected 34 days later. To 
test neutralizing and cross-neutralizing activity of sera to each serotype of AAV, sera was 
analyzed in a transduction based neutralizing antibody assay [Gao, G. P., et al, (1996) J 
Virol 70, 8934-43]. More specifically, the presence of neutralizing antibodies was 
determined by assessing the ability of serum to inhibit transduction of 84-3 1 cells by 

15 reporter viruses (AAVCMVEGFP) of different serotypes. Specifically, the reporter virus 
AAVCMVEGFP of each serotype [at multiplicity of infection (MOi) that led to a 
transduction of 90% of indicator cells] was pre-incubated with heat-inactivated serum 
from animals that received different serotypes of AAV or from naive mice. After 1-hour 
incubation at 37° C, viruses were added to 84-31 cells in 96 well plates for 48 or 72- hour, 

20 depending on the virus serotype. Expression of GFP was measured by Fluorolmagin 
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(Molecular Dynamics) and quantified by Image Quant Software. Neutralizing antibody 
titers were reported as the highest serum dilution that inhibited transduction to less than 
50%. 

The availability of GFP expressing vectors simplified the development of an 
5 assay for neutralizing antibodies that was based on inhibition of transduction in a 

permissive cell line (i.e., 293 cells stably expressing E4 from Ad5). Sera to selected AAV 
serotypes were generated by intramuscular injection of the recombinant viruses. 
Neutralization of AAV transduction by 1 :20 and 1 :80 dilutions of the antisera was 
evaluated (Table 2). Antisera to AAV1, AAV2, AAV5 and AAV8 neutralized 
10 transduction of the serotype to which the antiserum was generated (AAV5 -and AAV8 to a 
lesser extent than AAV1 and AAV2) but not to the other serotype (i.e., there was no 
evidence of cross neutralization suggesting that AAV 8 is a truly unique serotype). 



Table 2. Serological Analysis of New AAV Serotypes. 





Serum dilution: 


Serum dilution: 


Serum dilution: 


Serum dilution: 


• Sera: 


Immunization Vector 


1/20 


1/80 


1/20 


1/80 


1/20 


1/80 


1/20 


1/80 


Group 1 


AAV2/1 


0 


0 


100 


100 


100 


100 


100 


100 


Group 2 


AAV2/2 


100 


100 


0 


0 


100 


100 


100 


100 


Group 3 


AAV2/5 


100 


100 


100 


100 


16.5 


16.5 


100 


100 


Group 4 


AAV2/8 


100 


100 


100 


100 


100 


100 


26.3 


60 



20 Human sera from 52 normal subjects were screened for neutralization against 

selected serotypes. No serum sample was found to neutralize AAV2/8 while AAV2/2 and 
AAV2/1 vectors were neutralized in 20% and 10% of sera, respectively. A fraction of 
human pooled IqG representing a collection of 60,000 individual samples did not 
neutralize AAV2/8, whereas AAV2/2 and AAV2/1 vectors were neutralized at titers of 

25 serum equal to 1/1280 and 1/640, respectively. 

Example 4 - In vivo Evaluation of Different Serotypes of AAV Vectors 

In this study, 7 recombinant AAV genomes, AAV2CBhAl AT, AAV2AlbhAl AT, 
AAV2CMVrhCG, AAV2TBGrhCG, AAV2TBGcFIX, AAV2CMVLacZ and 
30 AAV2TBGLacZ were packaged with capsid proteins of different serotypes. In all 7 
constructs, minigene cassettes were flanked with AAV2 ITRs. cDNAs of human ot- 
antitrypsin (A1AT) [Xiao, W, et al., (1999) J Virol 73, 3994-4003] (3-subunit of rhesus 
monkey choriogonadotropic hormone (CG) [Zoltick, P. W. & Wilson, J. M. (2000J Mol 
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Ther 2, 657-9] canine factor IX [Wang, L., et al., (1997) Proc Natl Acad Sci USA 94, 
1 1563-6] and bacterial P-glactosidase (i.e., Lac Z) genes were used as reporter genes. For 
liver-directed gene transfer, either mouse albumin gene promoter (Alb) [Xiao, W. (1999), 
cited above] or human thyroid hormone binding globulin gene promoter (TBG) [Wang 
5 (i"997), cited above] was used to drive liver specific expression of reporter genes. In 

muscle-directed gene transfer experiments, either cytomegalovirus early promoter (CMV) 
or chicken p-actin promoter with CMV enhancer (CB) was employed to direct expression 
of reporters. 

For muscle-directed gene transfer, vectors were injected into the right tibialis 

10 anterior of 4-6 week old NCR nude or C57BL/6 mice (Taconic, Germantown, NY) at a 
dose of 1 x 10 n genome copies (GC) per animal. In liver-directed gene transfer studies, 
vectors were infused intraportally into 7-9 week old NCR nude or C57BL/6 mice 
(Taconic, Germantown, NY), also at a dose of 1 x 10 1 1 genome copies (GC) per animal. 
Serum samples were collected intraorbitally at different time points after vector 

15 administration. Muscle and liver tissues were harvested at different time points for 
cryosectioning and Xgal histochemical staining from animals that received the lacZ 
vectors. For the re-administration experiment, C56BL/6 mice initially received AAV2/1, 
2/2, 2/5, 2/7 and 2/8CBA1AT vectors intramuscularly and followed for A 1 AT gene 
expression for 7 weeks. Animals were then treated with AAV2/8TBGcFIX intraportally 

20 and studied for cFIX gene expression. 

ELISA based assays were performed to quantify serum levels of hAlAT, rhCG 
and cFIX proteins as described previously [Gao, G. P., et al., (1996) J Virol 70, 8934-43; 
Zoltick, P. W. & Wilson, J. M. (2000) Mol Ther 2, 657-9; Wang, L., et al., Proc Natl 
Acad Sci U SA 94, 1 1563-6]. The experiments were completed when animals were 

25 sacrificed for harvest of muscle and liver tissues for DNA extraction and quantitative 
analysis of genome copies of vectors present in target tissues by TaqMan using the same 
set of primers and probe as in titration of vector preparations [Zhang, Y., et al., (2001) 
Mol Ther 3, 697-707]. 

The performance of vectors base on the new serotypes were evaluated in murine 

30 models of muscle and liver-directed gene transfer and compared to vectors based on the 
known serotypes AAV1, AAV2 and AAV5. Vectors expressing secreted proteins (A1AT 
and CG-Table 3) were used to quantitate relative transduction efficiencies between 
different serotypes through ELISA analysis of sera. The cellular distribution of 
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transduction within the target organ was evaluated using lacZ expressing vectors and X- 
gal histochemistry . 

The performance of AAV vectors in skeletal muscle was analyzed following 
direct injection into the tibialis anterior muscles. Vectors contained the same AAV2 
5 based genome with the immediate early gene of CMV or a CMV enhanced p-actin — 
promoter driving expression of the transgene. Previous studies indicated that immune 
competent C57BL/6 mice elicit limited humoral responses to the human Al AT protein 
when expressed from AAV vectors [Xiao, W., et al., (1999) J Virol 73, 3994-4003]. 

In each strain, AAV2/1 vector produced the highest levels of A1AT and AAV2/2 
10 vector the lowest, with AAV2/8 vectors showing intermediate levels of expression. Peak 
levels of CG at 28 days following injection of nu/nu NCR mice showed the highest levels 
from AAV2/7 and the lowest from AAV2/2 with AAV2/8 and AAV2/1 in between. 
Injection of AAV2/1 lacZ vectors yielded gene expression at the injection sites in all 
muscle fibers with substantially fewer lacZ positive fibers observed with AAV2/2 and 
15 AAV 2/8 vectors. 

Similar murine models were used to evaluate liver-directed gene transfer. 
Identical doses of vector based on genome copies were infused into the portal veins of 
mice that were analyzed subsequently for expression of the transgene. Each vector 
contained an AAV2 based genome using previously described liver-specific promoters 
20 (i.e., albumin or thyroid hormone binding globulin) to drive expression of the transgene. 
More particularly, CMVCG and TBGCG minigene cassettes were used for muscle and 
liver-directed gene transfer, respectively. Levels of rhCG were defined as relative units 
(rUs x 10 3 ). The data were from assaying serum samples collected at day 28, post vector 
administration (4 animals per group). As shown in Table 4, the impact of capsid proteins 
25 on the efficiency of transduction of Al AT vectors in nu/nu and C57BL/6 mice and CG 
vectors in C57BL/6 mice was consistent, i.e., AAV2/8 is the most efficient for 
pseudotype for liver-directed gene transfer. 
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Table 3. Expression of 0-unit of Rhesus Monkey Chorionic Gonadotropin 
(rhCG) in Mouse Muscle and Liver . 



5 Vector Muscle Liver 

AAV2/1 4.5 ± 2.1 1.6 ± 1.0 

AAV2 0.5 ± 0.1 0.7 ± 0.3 

AAV2/5 ND* 4.8 ± 0.8 

AAV2/8 4.0 ± 0.7 76.0 ± 22.8 
10 * Not determined in this experiment. 



In all cases, AAV2/8 vectors yielded the highest levels of transgene expression that 
ranged from 16 to 1 10 greater than what was obtained with AAV2/2 vectors; expression 
from AAV2/5 was intermediate. Analysis of X-Gal stained liver sections of animals that 

1 5 received the corresponding lacZ vectors showed a correlation between the number of 
transduced cells and overall levels of transgene expression. DNAs extracted from livers 
of C57BL/6 mice who received the Al AT vectors were analyzed for abundance of vector 
DNA using real time PCR technology. 

The amount of vector DNA found in liver 56 days after injection correlated with 

20 the levels of transgene expression (Table 4). For this experiment, a set of probe and 
primers targeting the SV40 polyA region of the vector genome was used for TaqMan 
PCR. Values shown are means of three individual animals with standard deviations. The 
animals were sacrificed at day 56 to harvest liver tissues for DNA extraction. These 
studies indicate that AA V8 is the most efficient vector for liver-directed gene transfer due 

25 to increased numbers of transduced hepatocytes. 
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Table 4. Real Time PCR Analysis for Abundance of AAV Vectors in nu/nu Mouse 
Liver Following Injection of lxlO 11 Genome Copies of Vector. 

AAV vectors/Dose Genome Copies per Cell 

5 

AAV2/lAlbAlAT 0.6 ± 0.36 

AAV2AlbAlAT 0.003 + 0.001 

10 AAV2/5AlbAlAT 0.83 ± 0.64 

AAV2/8AlbAlAT 18 ±11 

15 The serologic data described above suggest that AAV2/8 vector should not be 

neutralized in vivo following immunization with the other serotypes. C57BL/6 mice 
received intraportal injections of AAV2/8 vector expressing canine factor IX (10 11 
genome copies) 56 days after they received intramuscular injections of A 1 AT vectors of 
different serotypes. High levels of factor IX expression were obtained 14 days following 

20 infusion of AAV2/8 into naive animals (17+2 p,g/ml, N=4) which were not significantly 
different that what was observed in animals immunized with AAV2/1 (31+23 ng/ml, 
N=4), and AAV2/2 (16 fig/ml, N=2): This contrasts to what was observed in AAV2/8 
immunized animals that were infused with the AAV2/8 factor IX vector in which no 
detectable factor EX was observed (< 0.1 (ig/ml, N=4). 

25 Oligonucleotides to conserved regions of the cap gene did amplify sequences 

from rhesus monkeys that represented unique AAVs. Identical cap signature sequences 
were found in multiple tissues from rhesus monkeys derived from at least two different 
colonies. Full-length rep and cap open reading frames were isolated and sequenced from 
single sources. Only the cap open reading frames of the novel AAVs were necessary to 

30 evaluate their potential as vectors because vectors with the AAV8 capsids were generated 
using the ITRs and rep from AAV2. This also simplified the comparison of different 
vectors since the actual vector genome is identical between different vector serotypes. In 
fact, the yields of recombinant vectors generated using this approach did not differ 
between serotypes. 

35 Vectors based on AAVS appear to be immunologically distinct (i.e., they are not 

neutralized by antibodies generated against other serotypes). Furthermore, sera from 
humans do not neutralize transduction by AAVS vectors, which is a substantial advantage 
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over the human derived AAVs currently under development for which a significant 
proportion of the human population has pre-existing immunity that is neutralizing 
[Chirmule, N., et ah, (1999) Gene Ther 6, 1574-83]. 

The tropism of the new vector is favorable for in vivo applications. Importantly, 
5 AAV2/8 provides-a-substantial advantage over the other serotypes in terms ofefficiency 
of gene transfer to liver that until now has been relatively disappointing in terms of the 
numbers of hepatocytes stably transduced. AAV2/8 consistently achieved a 10 to 100- 
fold improvement in gene transfer efficiency as compared to the other vectors. The basis 
for the improved efficiency of AAV2/8 is unclear, although it presumably is due to 

10 uptake via a different receptor that is more active on the basolateral surface of 

hepatocytes. This improved efficiency will be quite useful in the development of liver- 
directed gene transfer where the number of transduced cells is critical, such as in urea 
cycle disorders and familial hypercholesterolemia. 

Thus, the lack of pre-existing immunity to AAVS and the favorable tropism of the 

15 vectors for liver indicates that vectors with AAVS capsid proteins are suitable for use as 
vectors in human gene therapy and other in vivo applications. 

Example 5 - Tissue Tropism Studies 

In the design of a high throughput functional screening scheme for novel AAV 

20 constructs, a non-tissue specific and highly active promoter, CB promoter (CMV 
enhanced chicken (3-actin promoter) was selected to drive an easily detectable and 
quantifiable reporter gene, human a-anti-trypsin gene. Thus only one vector for each new 
AAV clone needs to be made for gene transfer studies targeting 3 different tissues, liver, 
lung and muscle to screen for tissue tropism of a particular AAV construct. The 

25 following table summarizes data generated from novel AAV vectors in the tissue tropism 
studies (AAVCBA1 AT). Table 5 reports data obtained (in \ig Al AT/mL serum) at day 
14 of the study. 
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Table 5 



Vector 


Target Tissue 




Lung 


Liver 


Muscle 


AAV2/1 


ND 


ND 


45±11 


AAV2/5— 


0.6±0.2 


ND " 


ND 


AAV2/8 


ND 


84±30 


ND 



AAV vector carried CClOhAl AT minigene for lung specific expression were 
pseudotyped with capsids of novel AAVs were given to Immune deficient animals (NCR 
5 nude) in equal volume (50 \il each of the original preps without dilution) via intratracheal 
injections as provided in the following table. The vectors were also administered to 
immune competent animals (C57BL/6) in equal genome copies (lxl 0 n GC) as shown in 
the Table 6. (lxlO 11 GC per animal, C57BL/6, day 14, detection limit >0.033 jig/ml). As 
shown, AAVS is the best liver transducer. 
10 Table 6 



AAV Vector 


jig of AlAT/ml 
with lxl0 n vector 


2/1 


0.076±0.031 


2/2 


0.1±0.09 


2/5 


0.0840.033 


2/8 


1.92±].3 



Example 6 -Model of Hypercholesterolemia 
15 To further assess the affect of rAAV-mediated transgene expression by the 

AAV2/8 constructs of the invention, a further study was performed. 
A. Vector Construction 

AAV vectors packaged with AAVS capsid proteins were constructed 
using a pseudotyping strategy [Hildinger M, etai, J. Virol 2001; 75:6199-6203]. 
20 Recombinant AAV genomes with AAV2 inverted terminal repeats (ITR) were packaged 
by triple transfection of 293 cells with the c/>plasmid, the adenovirus helper plasmid and 
a chimeric packaging construct, a fusion of the capsids of the novel AAV serotypes with 
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the rep gene of AAV2. The chimeric packaging piasmid was constructed as previously 
described [Hildinger et al, cited above]. The recombinant vectors were purified by the 
standard CsCl 2 sedimentation method. To determine the yield TaqMan (Applied 
Biosystems) analysis was performed using probes and primers targeting the SV40 
5 poly(A) regioirof the vectors [Gao GP, et al, Hum Gene Ther. 2000 Ocrt0;ll(l 5):2079- 
91]. The resulting vectors express the transgene under the control of the human thyroid 
hormone binding globulin gene promoter (TBG). 
B. Animals 

LDL receptor deficient mice on the C57B1/6 background were purchased 

10 from the Jackson Laboratory (Bar Harbor, ME, USA) and maintained as a breeding 
colony. Mice were given unrestricted access to water and obtained a high fat Western 
Diet (high % cholesterol) starting three weeks prior vector injection. At day -7 as well at 
day 0, blood was obtained via retroorbital bleeds and the lipid profile evaluated. The mice 
were randomly divided into seven groups. The vector was injected via an intraportal 

15 injection as previously described ([Chen SJ et al, Mol Therapy 2000; 2(3), 256-261]. 
Briefly, the mice were anaesthetized with ketamine and xylazine. A laparotomy was 
performed and the portal vein exposed. Using a 30g needle the appropriate dose of vector 
diluted in 100 jxl PBS was directly injected into the portal vein. Pressure was applied to 
the injection site to ensure a stop of the bleeding. The skin wound was closed and draped 

20 and the mice carefully monitored for the following day. Weekly bleeds were performed 
starting at day 14 after liver directed gene transfer to measure blood lipids. Two animals 
of each group were sacrificed at the timepoints week 6 and week 12 after vector injection 
to examine atherosclerotic plaque size as well as receptor expression. The remaining mice 
were sacrificed at week 20 for plaque measurement and determination of transgene 

25 expression. 





Vector 


dose 




N 


Group 1 


AAV2/8-TBG-hLDLr 


lx 10 12 


gc 


12 


Group 2 


A A V2/8-TB G-hLDLr 


3x10" 


gc 


12 


Group 3 


AA V2/8-TB G-hLDLr 


lx 10" 


gc 


12 
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C. Serum lipoprotein and liver function. analysis 

Blood samples were obtained from the retroorbital plexus after a 6 hour 
fasting period. The serum was separated from the plasma by centrifugation. The amount 
of plasma lipoproteins and liver transaminases in the serum were detected using an 
5 automatized clinical chemistry analyzer (ACE, Schiapparelli Biosystems, Alpha 
Wassermann) 

D. Detection of transgene expression 

LDL receptor expression was evaluated by immuno-fluorescence staining 
and Western blotting. For Western Blot frozen liver tissue was homogenized with lysis 
10 buffer ( 20 mM Tris, pH 7.4, 130mM NaCI, 1% Triton X 100, proteinase inhibitor 
complete, EDTA-free, Roche, Mannheim, Germany). Protein concentration was 
determined using the Micro BCA Protein Assay Reagent Kit (Pierce, Rockford, IL). 40 
\ig of protein was resolved on 4- 15% Tris-HCl Ready Gels (Biorad, Hercules, CA) and 
transferred to a nitrocellulose membrane (Invitrogen). To generate anti-hLDL receptor 
15 antibodies a rabbit was injected intravenously with an AdhLDLr prep (lxlO 13 gc). Four 
weeks later the rabbit serum was obtained and used for Western Blot. A 1:100 dilution of 
the serum was used as a primary antibody followed by a HRP-conjugated anti-rabbit IgG 
and ECL chemiluminescent detection (ECL Western Blot Detection Kit, Amersham, 
Arlington Heights, IL). 
20 D. Immiinocytochemistry 

For determination of LDL receptor expression in frozen liver sections 
immunohistochemistry analyses were performed. lOum cryostat sections were either 
fixed in acetone for 5 minutes, or unfixed. Blocking was obtained via a 1 hour incubation 
period with 10% of goat serum. Sections were then incubated for one hour with the 
25 primary antibody at room temperature. A rabbit polyclonal antibody anti-human LDL 
(Biomedical Technologies Inc., Stoughton, MA) was used diluted accordingly to the ■ 
instructions of the manufacturer. The sections were washed with PBS, and incubated with 
1:100 diluted fluorescein goat anti-rabbit IgG (Sigma, St Louis, MO). Specimens were 
finally examined under fluorescence microscope Nikon Microphot-FXA. In all cases, 
30 each incubation was followed by extensive washing with PBS. Negative controls 

consisted of preincubation with PBS, omission of the primary antibody, and substitution 
of the primary antibody by an isotype-matched non-immune control antibody. The three 
types of controls mentioned above were performed for each experiment on the same day. 
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E. Gene transfer efficiency 

Liver tissue was obtained after sacrificing the mice at the designated time 
points. The tissue was shock frozen in liquid nitrogen and stored at -80°C until further 
processing. DNA was extracted from the liver tissue using a QIAamp DNA Mini Kit 
5 — (QIAGEN GmbH, Germany) according to the manufacturers protocol. Genome copies of 
AAV vectors in the liver tissue were evaluated using Taqman analysis using probes and 
primers against the SV40 poly(A) tail as described above. 

F. Atherosclerotic plaque measurement 

For the quantification of the atherosclerotic plaques in the mouse aorta the 
10 mice were anaesthetized (10% ketamine and xylazine, ip) 5 the chest opened and the 

arterial system perfused with ice-cold phosphate buffered saline through the left ventricle. 
The aorta was then carefully harvested, slit down along the ventral midline from the 
aortic arch down to the femoral arteries and fixed in formalin. The lipid-rich 
atherosclerotic plaques were stained with Sudan IV (Sigma, Germany) and the aorta 
1 5 pinned out flat on a black wax surface. The image was captured with a Sony DXC-960 
MD color video camera. The area of the plaque as well as of the complete aortic surface 
was determined using Phase 3 Imaging Systems (Media Cybernetics). 

G. Clearance of I ' 25 LDL 

Two animals per experimental group were tested. A bolus of 1 125 -labeled 
20 LDL (generously provided by Dan Rader, Upenn) was infused slowly through the tail 
vein over a period of 30 sec (1,000,000 counts of [1 125 ]-LDL diluted in 1 00j.il sterile 
PBS/ animal). At time points 3 min, 30 min, 1 .5 hr, 3 hr, 6 hr after injection a blood 
sample was obtained via the retro-orbital plexus. The plasma was separated off from the 
whole blood and 10p.l plasma counted in the gamma counter. Finally the fractional 
25 catabolic rate was calculated from the lipoprotein clearance data. 

H. Evaluation of Liver Lipid accumulation 

Oil Red Staining of frozen liver sections was performed to determine lipid 
accumulation. The frozen liver sections were briefly rinsed in distilled water followed by 
a 2 minute incubation in absolute propylene glycol. The sections were then stained in oil 
30 red solution (0.5% in propylene glycol) for 16 hours followed by counterstaining with 
Mayer's hematoxylin solution for 30 seconds and mounting in warmed glycerin jelly 
solution. 
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For quantification of the liver cholesterol and triglyceride content liver 
sections were homogenized and incubated in chloroform/methanol (2:1) overnight. After 
adding of 0.05% H 2 S0 4 and centrifugation for 10 minutes, the lower layer of each sample 
was collected, divided in two aliquots and dried under nitrogen. For the cholesterol 
5 measurement the dried lipids of the first aliquot were dissolved in 1 % Triton X-l 00 ih~ 
chloroform. Once dissolved, the solution was dried under nitrogen. After dissolving the 
lipids in ddH 2 0 and incubation for 30 minutes at 37°C the total cholesterol concentration 
was measured using a Total Cholesterol Kit (Wako Diagnostics). For the second aliquot 
the dried lipids were dissolved in alcoholic KOH and incubated at 60'C for 30 minutes. 
10 Then 1M MgCl 2 was added, followed by incubation on ice for 10 minutes and 

centrifugation at 14,000 rpm for 30 minutes. The supernatant was finally evaluated for 
triglycerides (Wako Diagnostics). 

All of the vectors pseudotyped in an AAV2/S capsid lowered total cholesterol, 
LDL and triglycerides as compared to the control. These test vectors also corrected 
15 phenotype of hypercholesterolemia in a dose-dependent manner. A reduction in plaque 
area for the AAV2/8 mice was observed in treated mice at the first test (2 months), and 
the effect was observed to persist over the length of the experiment (6 months). 

Example 7 - Functional Factor IX Expression and Correction of Hemophilia 

20 A. Knock-Out Mice 

Functional canine factor IX (FIX) expression was assessed in hemophilia 
B mice. Vectors with capsids of AAV1, AAV2, AAV5 or AAV8 were constructed to 
deliver AAV2 5' ITR - liver-specific promoter [LSP] - canine FIX - woodchuck hepatitis 
post-regulatory element (WPRE) - AAV2 3' ITR . The vectors were constructed as 

25 described in Wang et al, 2000, Molecular Therapy 2: 154-158), using the appropriate 
capsids. 

Knock-out mice were generated as described in Wang et al, 1997. Proc. 
Natl. Acad. Set USA 94: 1 1563-1 1566. This model closely mimic the phenotypes of 
hemophilia B in human. 

Vectors of different serotypes were delivered as a single intraportal 
injection into the liver of adult hemophiliac C57BI/6 mice in a dose of lxl 0 1 1 GC/mouse 
for the five different serotypes and a second AAV8 vector was also delivered at lxlO 10 
GC/mouse. Control group was injected with 1x10" GC of AAV2/8 TBG LacZ3. Each 
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group contains 5-10 male and female mice. Mice were bled bi-weekly after vector 
administration. 

1. ELISA 

The canine FIX concentration in the mouse plasma was 
5 determined by an ELISA assay-specific for canine factor IX, performed essentially as ~ 
described by Axelrod et al, 1 990, Proc.Natl.Acad.Sci. USA, 87:5 1 73-5 J 77 with 
modifications. Sheep anti-canine factor IX (Enzyme Research Laboratories) was used as 
primary antibody and rabbit anti-canine factor IX ((Enzyme Research Laboratories) was 
used as secondary antibody. Beginning at two weeks following injection, increased 
1 0 plasma levels of cFIX were detected for all test vectors. The increased levels were 

sustained at therapeutic levels throughout the length of the experiment, i.e., to 12 weeks. 
Therapeutic levels are considered to be 5% of normal levels, i.e., at about 250 ng/mL. 

The highest levels of expression were observed for the AAV2/8 (at 10"), 
with sustained superphysiology levels cFIX levels (ten-fold higher than the normal level). 
Expression levels for AAV2/8 (] 0 1 ') were approximately 1 0 fold higher than those 
observed for AAV2/2 and AAV2/8 (10 10 ) . The lowest expression levels, although still 
above the therapeutic range, were observed for AAV2/5. 

2. In Vitro Activated Partial Thromboplastin time (aPTT) Assay 
Functional factor IX activity in plasma of the FIX knock-out mice 
was determined by an in vitro activated partial thromboplastin time (aPTT) assay-Mouse 
blood samples were collected from the retro-orbital plexus into 1/10 volume of citrate 
buffer. APTT assay was performed as described by Wang et al, 1997, Proc. Natl. Acad. 
Sci. USA 94: 11563-11566. 

Clotting times by aPTT on plasma samples of all vector injected 
mice were within the normal range (approximately 60 sec) when measured at two weeks 
post-injection, and sustained clotting times in the normal or shorter than normal range 
throughout the study period (12 weeks). 

Lowest sustained clotting times were observed in the animals 
receiving AAV2/8 (10"). By week 12, AAV2/2 also induced clotting times similar to 
those for AAV2/8. However, this lowered clotting time was not observed for AAV2/2 
until week 12, whereas lowered clotting times (in the 25 - 40 sec range) were observed 
for AAV2/8 beginning at week two. 
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Immuno-histochemistry staining on the liver tissues harvested 
from some of the treated mice is currently being performed. About 70-80% of 
hepatocytes are stained positive for canine FIX in the mouse injected with AAV2/8.cFIX 
vector. 

5 B. —Hemophilia B Dogs — 

Dogs that have a point mutation in the catalytic domain of the F.IX gene, 
which, based on modeling studies, appears to render the protein unstable, suffer from 
hemophilia B [Evans et al, 1989, Proc. Natl. Acad. Sci. USA, 86:10095-10099). A 
colony of such dogs has been maintained for more than two decades at the University of 

10 North Carolina, Chapel Hill. The homeostatic parameters of these doges are well 

described and include the absence of plasma F.IX antigen, whole blood clotting times in 
excess of 60 minutes, whereas normal dogs are 6-8 minutes, and prolonged activated 
partial thromboplastin time of 50-80 seconds, whereas normal dogs are 13-28 seconds. 
These dogs experience recurrent spontaneous hemorrhages. Typically, significant 

15 bleeding episodes are successfully managed by the single intravenous infusion of 10 
ml/kg of normal canine plasma; occasionally, repeat infusions are required to control 
bleeding. 

Four dogs were injected intraportally with AAV.cFIX according to the 
schedule below. A first dog received a single injection with AAV2/2.cFIX at a dose of 

20 3.7x1 0 1 1 genome copies (GC)/kg and was sacrificed at day 665 due to severe spinal 

hemorrhage. A second dog received a first injection of AAV2/2.cFIX (2.8x1 0 11 GC/kg), 
followed by a second injection with AAV2/5.cFIX (2.3x1 0 13 GC/kg) at day 1 1 80. A third 
dog received a single injection with AAV2/2.cFIX at a dose of 4.6x1 0 12 GC/kg. The 
fourth dog received an injection with AAV2/2.cFIX (2.8xl0 12 GC/kg) and an injection at 

25 day 995 with AAV2/8.cFIX (5xl0 12 GC/kg). 

The abdomen of hemophilia dogs were aseptically and surgically opened 
under general anesthesia and a single infusion of vector was administered into the portal 
vein. The animals were protected from hemorrhage in the peri-operative period by 
intravenous administration of normal canine plasma. The dog was sedated, intubated to 

30 induce general anesthesia, and the abdomen was shaved and prepped. After the abdomen 
was opened, the spleen was moved into the operative field. The splenic vein was located 
and a suture was loosely placed proximal to a small distal incision in the vein. An 
introduced was rapidly inserted into the vein, then the suture loosened and a 5 F cannula 
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10 



15 



20 



was threaded to an intravenous location near the portal vein threaded to an intravenous 
location near the portal vein bifurcation. After hemostasis was secured and the catheter 
balloon was inflated, approximately 5.0 ml of vector diluted in PBS was infused into the 
portal vein over a 5 minute interval. The vector infusion was followed by a 5.0 ml 
infusion of saline.-The balloon was then deflated, the callula was removed antl venous 
hemostatis was secured. The spleen was then replaced, bleeding vessels were cauterized 
and the operative wound was closed. The animal was extubated having tolerated the 
surgical procedure well. Blood samples were analyzed as described. [Wang et al, 2000, 
Molecular Therapy 2 : 1 54- 1 5 8] 

The results are summarized in the table below. Dog C51, female, was 13.6 kg 
and 6.5 months old at the time of first injection. Dog C52, male, was 17.6 kg and 6.5 
months old at first injection; and 17.2 kg and 45.2 months at second injection. Dog C55, 
male, was a 19.0 kg and 12.0 months at first injection. Dog D39, female, was a 5.0 kg 
and 2.8 months at first injection; 22.6 kg and 35.4 months old at the time of the second 
injection. In the table, GC refers to genome copies of the AAV vectors. WBCT were > 
60 minutes (except C52 = 42 min) before injection. Baseline aPTT for C51 = 98.4 sec, 
C52 = 97.7 sec; C55 = 145.1 sec; D39 = 97.8 sec. Bleeds post-treatment were 
spontaneous bleeding episodes happening in hemophilia B dogs post-AAV vector 
treatment that required treatment with plasma infusion. 



Hemophilia B Dogs In jected with rAAV intraportally 





Dog 


Vector 


Vector 
Dose 
(GC/kg) 


Total GC 
Inject 


Avg 

WBCT 

(min) 


Avg aPTT 
(min) 


Avg cFIX 

plasma 

(ng/mL) 


1 st 

injection 


C51 


AAV2- 
LSP.cFIX 


3.7x10" 


5xl0 12 


13.2 
±2.1 


77.5 ±15.1 


3.8 ±1.0 




C52 


AAV2- 
LSP.cFIX 


2.8x10" 


5.0x1 0 12 


16.1 ±3.5 


81.5 ± 
17.7 


3.7± 1.1 




C55 


AAV2- 

LSP.cFIX 

WPRE 


4.6xl0 12 


8.7xl0 13 


10.2±2.2 


46.4±6.1 


259.7±28.5 




D39 


AAV2- 

LSPcFDC 

WPRE 


2.8xl0 12 


1.4xl0 13 


11.5±2.6 


59.1±6.3 


34.4±9.8 . 
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Hemophilia B Dog 


;s Injected with rAAV intraportallv 




Dog 


Vector 


Vector 

Dose 

(GC/kg) 


Total GC 
Inject 


Avg 

WBCT 

(min) 


Avg aPTT 
(min) 


AvgcFIX 

plasma 

(ng/mL) 


injection 




A A V9/S- 
LSP.cFIX 
WPRE 


Z.JAl U 




1 9 Q-4-1 1 


41 Q4-0 7 


Q 1 7 ia_ 
Ol / J± 

102.1 


2 nd 

injection 


D39 


AAV2/8- 
LSP.cFIX 
WPRE 


5.0xl0 12 


l.lxlO 14 


12.6 ± 
1.5 




656.9 ± 
1.1 



1 . Whole Blood Clotting Time (WBCT) 

WBCT following injection with the AAV2/2 vectors were 
5 somewhat variable, ranging from about 6.5 min to 30 minutes. WBCT for a normal dog 
is 6-12 min. Sharp drops in WBCT were observed immediately upon injection with the 
AAV2/8 or AAV2/5 vectors The sharp drop was also observed in C55 injected with 
AAV2 (d2=9 min), and for C5 1 and C52, the early data point for WBCT were not 
checked. The sharp drop is believed to be due to the dog plasma infusion before and after 
10 the surgery. WBCT is an assay very sensitive to low level of FIX, it is not very sensitive 
to the actual level of FIX (aPTT is more relevant). 

2. aPTT Assay 

Clotting times by aPTT on plasma samples of all vector injected 
dogs were variable over the first approximately 700 days, at which time clotting times 

15 leveled in the normal range (40 - 60 sec, normal dog: 24-32 sec). A sharp drop into the 
normal range was observed following each of the second injections (AAV2/8 or 
AAV2/5). While clotting times were not sustained in the normal range, clotting times 
were reduced to levels below those observed prior to the second injection. 

For aPTT, normal dogs are 24-32 sec, and hemophilia B dogs are 

20 80-106 sec. For C51 and C52 who received low dose of AAV2.cFIX vector, average 
aPTT after treatment remain at 77.5 and 81.5 sec, not significantly different from 
hemophilia B dogs without treatment. Higher dose of AAV2 improved the average aPTT 
to 59.1 and 46.4 sec, respectively for D39 and C55. After the treatment of AAV2/5, the 
average aPTT for C52 improved significantly from 81.5 sec to 41.9 sec. And for D39, 

25 after the AAV2/8 treatment, the average aPTT improve from 59. 1 sec. 
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3. Canine Factor LX ELISA 

cFIX levels were detectable following the first set of injections, 
albeit below therapeutic levels. Following injection with AAV2/8 and AAV2/5, levels of 
cFIX rose spiked into the therapeutic range and then leveled off within the therapeutic 
5 range (normal is 5ug/ml in plasma, therapeutic level is 5% of normatlevel which is 250 
ng/ml). 

The first three weeks of WBCT, aPTT and cFIX antigen are 
affected by the dog plasma infusion before and after the surgery. It is hard to conclude 
the drop of clotting time or the rise of cFIX antigen level is due to the vector or the 

10 plasma infusion for the first 3 weeks. However, it is interesting to note that the quick and 
dramatic rise of cFIX antigen after 2/5 and 2/8 vector injection. This is unique to 
AAV2/5 and 2/8 injected dogs and could be attributed to AAV2/5 and 2/8 vectors rather 
than the normal dog plasma infusion, since all dogs received similar amount of normal 
dog plasma infusion for the surgery. Three days after AAV2/8 injection, the level of 

1 5 cFIX in the plasma of D39 reached 9.5 ug/ml and peaked at 1 0.4 ug/ml at day 6, twice as 
much as the normal level (5 ug/mi). The cFIX level gradually decreased to the average of 
817ng/ml (C52, AAV2/5) and 657 ng/ml (D39, AAV2/8). In C52, 3 days after injection 
of AAV2/5 vector, the cFIX level reached 2.6 ug/ml and peaked at 4.6 ng/ml at day 7. In 
C55, who received AAV2 vector at the dose similar to that of AAV2/8 injected to D39, 

20 peaked only at 2.2 ug/ml at day 3, then gradually dropped and maintained at 5% of 
normal level of cFIX. 

The doses of vector received by C55 (AAV2, 4.6x1 0 12 GC/kg) 
and the second injection in D39 (AAV2/8, 5x1 0 12 GC/kg) were very close. However, the 
cFIX expression levels raised in D39 by AAV2/8 vector (average 657-34=623 ng/ml, 

25 12.5% of normal level) was 2.5 fold higher than that in C55 (average 259 ng/ml, 5% of 
normal level). This suggests AAV2/8 is 2.5 fold more potent than AAV2 in dogs injected 
intraportally with similar dose of vectors. And in the same dog D39, the second injection 
of two fold higher dose of AAV2/8 dramatically increased the cFIX level from 0.7% to 
13.1%, 18.7 fold higher than the first injection. And in C52, the second injection of 

30 2.3xl0 13 GC/ml of AAV2/5 vector resulted in average 817 ng/ml (16.3% of normal level) 
of cFIX in the plasma. This was only marginally higher (1.3 fold) than the cFIX level 
raised in D39 by AAV2/8 (average 623 ng/ml, 12.5% of normal level,). However, the 
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dose of AAV2/5 injected in C52 was 4.6 fold higher than the dose of AAV2/8 injected in 
D39. This suggests that AAV2/8 vector is also more potent than AAV2/5 vector in dogs. 

The first injection of AAV2 vectors did not block the success of 
transduction by AAV2/5 and AAV2/8 vectors after the second injection in dogs. 
5 Readministration using a differentserotype of AAV vector can be used as an approach to ~~ 
treat animals or humans who have been previously exposed to AAV2 or treated with 
AAV2 vectors. 

Example 8 - Mouse Model of Liver Enzyme Disorder 

10 The AAV2/8 vector generated as described herein was studied for its efficiency in 

transferring the liver enzyme gene ornithine transcarbamylase (OTC) in an accepted 
animal model for OTC deficiency [X. Ye et al 3 Pediatric Research, 41(4):527-534 
(1997); X. Ye et al, 1 Biol Chem., 271(7):3639-3646 (Feb. 1996)]. The results of this 
experiment (data not shown) demonstrate that an AAV2/8 vector of the invention 

15 carrying the ornithine transcarbamylase (OTC) gene was observed to correct OTC 
deficiency. 

All publications cited in this specification are incorporated herein by 
reference. While the invention has been described with reference to particularly 
20 preferred embodiments, it will be appreciated that modifications can be made without 
departing from the spirit of the invention. 
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What is claimed is: 



1 . An isolated adeno-associated virus (AAV) comprising an AAV capsid 
having an amino acid sequence of AAV8, amino acids 1 to 737 of SEQ ID NO:2. 

2. The isolated AAV according to claim 2, wherein said virus has the 
nucleic acid sequence of SEQ ID NO: 1 . 

3 . The isolated AAV according to claim 1 , wherein said AAV further 
comprises a minigene having AAV inverted terminal repeats and a heterologous gene 
operably linked to regulatory sequences which direct its expression in a host cell. 

4. A protein comprising an AAV8 protein or a fragment thereof selected 
from the group consisting of: 

(a) an AAV8 capsid protein or fragment thereof, selected from the group 

consisting of: 

vpl capsid protein, amino acids (aa) 1 to 737; 
vp2 capsid protein, aa 138 to 737; 
vp3 capsid protein, aa 203 to 737; 

a fragment encompassing hypervariable region (HVR)l through 
12 or a smaller fragment thereof selected the group, consisting of: aa 146 to 152; aa 182 to 
187; aa 262 to 264; aa 263 to 266; aa 263 to 266; aa 381 to 383; 383 to 385; aa 450 to 
474; aa 451 to 475; aa 490 to 495; aa 491 to 496; aa500 to 504; aa 501 to 505; aa 514 to 
522; aa 533 to 554; aa 534 to 555; aa 581 to 594; aa 583 to 596; aa 658 to 667; aa 660 to 
669; and aa 705 to 719; aa 707 to 723; 

aa 24 to 42, aa 25 to 28; aa 81 to 85; aal33 to 165; aa 134 to 165; 
aa 137 to 143; aa 154 to 156; aa 194 to 208; aa 261 to 274; aa 262 to 274; aa 171 to 173; 
aa 185 to 198; aa 413 to 417; aa 449 to 478; aa 494 to 525; aa 534 to 571; aa 581 to 601; 
aa 660 to 671; aa 709 to 723; and 

aa 1 to 184, aa 199 to 259; aa 274 to 446; aa 603 to 659; aa 670 to 
706; aa 724 to 736; aa 1 85 to 198; aa 260 to 273; aa447 to 477; aa495 to 602; aa603 to 
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659; aa 660 to 669; and aa707 to 723, wherein the amino acid numbers are those of the 
AAV2 capsid, SEQ ID NO:4 and corresponding regions in the capsid of AAV8, SEQ ID 
NO:2; 

and 

(b) an AAV8 rep protein or fragment thereof selected from the group 

consisting of: 

aa 1 to 625; aa 1 to 102; aa 103 to 140; aa 141 to 173; aa 174 to 226; aa 
227 to 275; aa 276 to 374; aa 375 to 383; aa 384 to 446; aa 447 to 542; aa 543 to 555; and 
aa 556 to 625, of SEQ ID NO: 3. 

5. An artificial adeno-associated virus (AAV) capsid protein comprising one 
or more of the AAV8 capsid protein fragments according to claim 4a. 

6. A recombinant adeno-associated virus (AAV) comprising an artificial 
capsid according to claim 5. 

7. A molecule comprising a nucleic acid sequence encoding a protein 
according to claim 4. 

8. The molecule according to claim 7, wherein said nucleic acid sequence is 
selected from the group consisting of: 

vpl.nt 2121 to 4335; 
vp2, nt 2532 to 4335; and 
vp 3, nt 2730 to 4335 , 
wherein the nucleotides numbers are of.AAVS, SEQ ID NO:l. 

9. The molecule according to claim 7 or claim 8, wherein said molecule 
comprises an AAV sequence encoding an AAV capsid protein and a functional AAV 
rep protein. 

1 0. The molecule according to any of claims 7 to 9, wherein said nucleic 
acid sequence comprises a sequence selected from the group consisting of: nucleic 
acids 905 to 2104 of SEQ ID NO:l; nucleic acids 237 to 2104 of SEQ ID NO:l; 
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nucleic acids 905 to 2104 of SEQ ID NO:l; and nucleic acids 237 to 2104 of SEQ ID 
NO:l. 



1 1 • The molecule according to claim 9, wherein said molecule comprises 
a cap protein or a functional AAV rep gene from a serotype selected from the group 
consisting of AAV1, AAV2, AAV3, AAV4, AAV5 and AAV6. 

12. The molecule according to any of claims 7 to 1 1 , wherein said 
molecule is a plasmid. 

13. A method of generating a recombinant adeno-associated virus (AAV) 
comprising an AAV serotype capsid comprising the steps of culturing a host cell 
containing: (a) a molecule encoding an AAV capsid protein; (b) a functional rep gene; (c) 
a minigene comprising AAV inverted terminal repeats (ITRs) and a transgene; and (d) 
sufficient helper functions to permit packaging of the minigene into the AAV capsid 
protein, wherein said host cell comprises a molecule according to any of claims 7 to 12. 

14. A host cell transfected with an adeno-associated virus according to any 
of claims 1 to 3 or claim 6 or a molecule according to any of claims 7 to 12. 

15. A composition comprising an AAV according to any of claims 1 to 3 or 
claim 6, and a physiologically compatible carrier. 

16. A method of delivering a transgene to a cell, said method comprising the step 
of contacting the cell with an AAV according to any of claims 1 to 3 or claim 6, wherein 
said rAAV comprises the transgene. 



or 



17. Use of an adeno-associated virus according to any of claims 1 to 3 
claim 6 or a molecule according to any of claims 7 to 1 1 in preparing a medicament for 
delivery of a transgene to a cell. 
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FIG. 1A 
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FIG. 1B 
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FIG. 1C 
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tttttggcaa acaaaatget 


gecagagaca 


J / o U 


atgcggatta 


cagcgat gtc 


atgctcacca 


gcgaggaaga aatcaaaacc 


actaaccctg 


•DO /I A 


tggctacaga 


ggaatacggt 


atcgtggcag 


ataacttgea gcagcaaaac 


acggctcctc 


3300 


aaattggaac 


tgtcaacagc 


cagggggect 


tacceggtat ggtctggcag 


aacegggacg 


3960 


tgtacctgca 


gggtcccatc 


tgggecaaga 


ttcctcacac ggaeggcaac 


ttccacccgt 


4020 


ctccgctgat 


gggcggcttt 


ggectgaaac atcctccgcc tcagatcctg atcaagaaca 


4080 


cgcctgtacc 


tgcggatcct 


ccgaccacct 


tcaaccagtc aaagctgaac 


tctttcatca 


4140 


cgcaatacag 


caccggacag 


gtcagcgtgg 


aaattgaatg ggagctgcag aaggaaaaca 


4200 


gcaagcgctg 


gaaccccgag 


atccagtaca 


cctccaacta ctacaaatct 


acaagtgtgg 


4260 


actttgctgt 


taatacagaa 


ggcgtgtact 


ctgaaccccg ccccattggc acccgttacc 


4320 


tcacccgtaa 


vpl-3 stop polyA 
tctqtaattq cctgttaatc aataaaccgg ttgattcgtt tcagttgaac 


4380 


tttggtctct 


gcg 
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50 

QWWKLKPGPP PPKPAERHKD DSRGLVLPGY 
EWWDLKPGAP KPKANQQKQD NGRGLVLPGY 
EWWALKPGAP KPKANQQKQD DGRGLVLPGY 
EWWDLKPGAP KPKANQQKQD DGRGLVLPGY 
EWWALKPGVP QPKANQQHQD NRRGLVLPGY 
EWWDLKPGAP KPKANQQKQD DGRGLVLPGY 
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AAV 


2 


KYLGPFNGLD 


KGEPVNEADA 


AAV 


"7 


KYLGPFNGLD 


KGEPVNAADA 


AAV 


'8 


KYLGPFNGLD 


KGEPVNAADA 


AAV 


"l 


KYLGPFNGLD 


KGEPVNAADA 


AAV 


"3 


KYLGPGNGLD 


KGEPVNEADA 


AAV 


"9 


KYLGPFNGLD 


KGEPVNAADA 



100 

AALEHDKAYD RQLDSGDNPY LKYNHADAEF 
AALEHDKAYD QQLKAGDNPY LRYNHADAEF 
AALEHDKAYD QQLQAGDNPY LRYNHADAEF 
AALEHDKAYD QQLKAGDNPY LRYNHADAEF 
AALEHDKAYD QQLKAGDNPY LKYNHADAEF 
AALEHDKAYD QQLKAGDNPY LRYNHADAEF 







101 




AAV 


2 


QERLKEDTSF 


GGNLGRAVFQ 


AAV 


7 


QERLQEDTSF 


GGNLGRAVFQ 


AAV" 


"8 


QERLQEDTSF 


GGNLGRAVFQ 


AAV" 


1 


QERLQEDTSF 


GGNLGRAVFQ 


AAV" 


"3 


QERLQEDTSF 


GGNLGRAVFQ 


AAV~ 


_9 


QERLQEDTSF 


GGNLGRAVFQ 






151 




AAV 


2 


.VEPDSSSGT 


GKAGQQPARK 


AAV~ 


"7 


QRSPDSSTGI 


GKKGQQPARK 


AAV" 


"8 


QRSPDSSTGI 


GKKGQQPARK 


AAV" 


"l 


.QEPDSSSGI 


GKTGQQPAKK 


AAV" 


"3 


.QEPDSSSGV 


GKSGKQPARK 


AAV~ 


[9 


QE.PDSSSGI 


gKSSQQPAKK 






201 




AAV 


2 


TNTMATGSGA 


PMADNNEGAD 


AAV~ 


"7 


SGTVAAGGGA 


PMADNNEGAD 


AAV*" 


"8 


PNTMAAGGGA 


PMADNNEGAD 


AAV~ 


"l 


PTTMASGGGA 


PMADNNEGAD 


AAV" 


"3 


SNTMASGGGA 


PMADNNEGAD 


AAV_ 


"9 


PNTMASgGGA 


PMADNNEGAD 






251 




AAV 


2 


PTYNNHLYKQ 


ISSQS— GAS 


AAV 


'l 


PTYNNHLYKQ 


ISSETA-GST 


AAV 


'8 


PTYNNHLYKQ 


ISNGTSGGAT 


AAV 


"l 


PTYNNHLYKQ 


ISSAST.GAS 


AAV 


"3 


PTYNNHLYKQ 


ISSQS . . GAS 


AAV 


"9 


PTYNNHLYKQ 


ISNGTSGGST 



150 

AKKRVLEPLG LVEEPVKTAP GKKRPVEHSP 
AKKRVLEPLG LVEEGAKTAP AKKRPVEPSP 
AKKRVLEPLG LVEEGAKTAP GKKRPVEPSP 
AKKRVLEPLG LVEEGAKTAP GKKRPVEQSP 
AKKRILEPLG LVEEAAKTAP GKKGAVDQSP 
AKKRVLEPLG LVEEGA KTAP GKKRPVEQSP 



200 

RLNFGQTGDA DSVPDPQPLG QPPAAPSGLG 
RLNFGQTGDS ESVPDPQPLG EPPAAPSSVG 
RLNFGQTGDS ESVPDPQPLG EPPAAPSGVG 
RLNFGQTGDS ESVPDPQPLG EPPATPAAVG 
RLNFGQTGDS ESVPDPQPLG EPPAAPTSLG 
RLNFGQTGDS ESVPDPQPLG EPPEAPSGLG 



250 

GVGNSSGNWH CDSTWMGDRV ITTSTRTWAL 
GVGNASGNWH CDSTWLGDRV ITTSTRTWAL 
GVGSSSGNWH CDSTWLGDRV ITTSTRTWAL 
GVGNASGNWH CDSTWLGDRV ITTSTRTWAL 
GVGNSSGNWH CDSQWLGDRV ITTSTRTWAL 
GVGNSSGNWH CDSTWLGDRV ITTSTRTWAL 



300 

NDNHYFGYST PWGYFDFNRF HCHFSPRDWQ 
NDNTYFGYST PWGYFDFNRF HCHFSPRDWQ 
NDNTYFGYST PWGYFDFNRF HCHFSPRDWQ 
NDNHYFGYST PWGYFDFNRF HCHFSPRDWQ 
NDNHYFGYST PWGYFDFNRF HCHFSPRDWQ 
NDNTYFGYST PWGYFDFNRF HCHFSPRDWQ 
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301 

AAV_2 RLINNNWGFR PKRLNFKLFN 

AAV_7 RL I NNNWG FR PKKLRFKLFN 

AAV_8 RLINNNWGFR PKRLSFKLFN 

AAV_1 RLINNNWGFR PKRLNFKLFN 

AAV_3 RLINNNWGFR PKKLSFKLFN 

AAV 9 RLINNNWGFR PKRLNFKLFN 



350 

IQVKEVTQND GTTTIANNLT STVQVFTDSE 

IQVKEVTTND GVTTIANNLT STIQVFSDSE 

IQVKEVTQNE GTKTIANNLT STIQVFTDSE 

IQVKEVTTND GVTTIANNLT STVQVFSDSE 

IQVRGVTQND GTTTIANNLT STVQVFTDSE 

IQVKEVTTNE GTKTIANNLT STVQVFTDSE 



351. 

AAV_2 YQLPYVLGSA HQGCLPPFPA 

AAVJ7 YQLPYVLGSA HQGCLPPFPA 

AAV_8 YQLPYVLGSA HQGCLPPFPA 

AAV_1 YQLPYVLGSA HQGCLPPFPA 

AAV_3 YQLPYVLGSA HQGCLPPFPA 

AAV 9 YQLPYVLGSA HQGCLPPFPA 



400 

DVFMVPQYGY LTLNNGSQAV GRSSFYCLEY 
DVFMIPQYGY LTLNNGSQSV GRSSFYCLEY 
DVFMIPQYGY LTLNNGSQAV GRSSFYCLEY 
DVFMIPQYGY LTLNNGSQAV GRSSFYCLEY 
DVFMVPQYGY LTLNNGSQAV GRSSFYCLEY 
DVFMVPQYGY LTLNNGSQAL GRSSFYCLEY 



401 

AAV_2 FPSQMLRTGN NFTFSYTFED 

AAV_7 FPSQMLRTGN NFEFSYSFED 

AAV_8 FPSQMLRTGN NFQFTYTFED 

AAV_1 FPSQMLRTGN NFTFSYTFEE 

AAV_3 FPSQMLRTGN NFQFSYTFED 

AAV 9 FPSQMLRTGN NFQFSYTFED 



450 

VPFHSSYAHS QSLDRLMNPL IDQYLYYLSR 
VPFHSSYAHS QSLDRLMNPL IDQYLYYLAR 
VPFHSSYAHS QSLDRLMNPL IDQYLYYLSR 
VPFHSSYAHS QSLDRLMNPL IDQYLYYLNR 
VPFHSSYAHS QSLDRLMNPL IDQYLYYLNR 
VPFHSSYAHS QSLDRLMNPL IDQYLYYLVg 



451 

AAV_2 TNTPSG . TTT QSRLQFSQAG 

AAV_7 TQSNPGGTAG NRELQFYQGG 

AAV_8 TQTTGG . TAN TQTLGFSQGG 

AAVJ. TQ.NQSGSAQ NKDLLFSRGS 

AAV_3 TQGTTSGTTN QSRLLFSQAG 

AAV 9 TQ^TG . . TGG TQTLAFSQAG 



500 

ASDIRDQS RNWLPGPCYRQQ RVSKTSADNN 
PSTMAEQA KNWLPGPCFRQQ RVSKTLDQNN 
PNTMANQA KNWLPGPCYRQQ RVSTTTGQNN 
PAGMSVQP KNWLPGPCYRQQ RVSKTKT DNN 
PQSMSLQA RNWLPGPCYRQQ RLSKTANDNN 
PSSMANQA RNWVPGPCYRQQ RVSTTTNQ^M 



501 

AAV_2 NSEYSWTGAT KYHLNGRDSL 

AAV_7 NSNFAWTGAT KYHLNGRNSL 

AAV_8 NSNFAWTAGT KYHLNGRNSL 

AAV_1 NSNFTWTGAS KYNLNGRESI 

AAV_3 NSNFPWTAAS KYHLNGRDSL 

AAV 9 NSNFAWTGAA KFKLNGRDSL 



550 

VNPGPAMASH KDDEEKFFPQ SGVLIFGKQG 
VNPGVAMATH KDDEDRFFPS SGVLIFGKTG 
ANPGIAMATH KDDEERFFPS NGILIFGKQN 
INPGTAMASH KDDEDKFFPM SGVMIFGKES 
VNPGPAMASH KDDEEKFFPM HGNLIFGKEG 
MNPGVAMASH KDDEDRF££S SGVLIFGKQG 



551 
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SEKTNVDIEK 


VMITDEEEIR 
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"7 


ATNKTT-LEN 


VLMTNEEEIR 
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"3 
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AAV" 
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AGNDGVDYSQ VLITDEEEIK 


A 



600 

TNPVATEQY GSVSTNLQRG NRQAATADVN 
TNPVATEEY GIVSSNLQAA NTAAQTQVVN 
TNPVATEEY GIVADNLQQQ NTAPQIGTVN 
TNPVATERF GTVAVNFQSS STDPATGDVH 
TNPVATEQY GTVANNLQSS NTAPTTGTVN 
TNPVATEEY gAYAIgNQAA NTQAQTGLVH 
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601 

AAV_2 TQGVLPGMVW QDRDVYLQGP 
AAV_7 NQGALPGMVW QNRDVYLQGP 
AAV_&~~ SQGALPGMVW QNRDVYLQGP 
AAV_1 AMGALPGMVW QDRDVYLQGP 
AAV_3 HQGALPGMVW QDRDVYLQGP 
AAV_9 NQGVIPGMVW QNRDVYLQGP 



650 

IWAKIPHTDG HFHPSPLMGG FGLKHPPPQI 
IWAKIPHTDG NFHPSPLMGG FGLKHPPPQI 
IWAKIPHTDG N FH PS PLMGGHFGLKH P P PQ I 
IWAKIPHTDG HFHPSPLMGG FGLKNPPPQI 
IWAKIPHTDG HFHPSPLMGG FGLKHPPPQI 
IWAKIPHTDG NFHPSPLMGG FGLKHPPPQI 



651 700 

AAV_2 LIKNTPVPA NPSTTFSAAKF ASFITQYSTG QVSVEIEWEL QKENSKRWNP 

AAVJ7 LIKNTPVPA NPPEVFTPAKF ASFITQYSTG QVSVEIEWEL QKENSKRWNP 

AAV_8 LIKNTPVPA DPPTTFNQSKL NSFITQYSTG QVSVEIEWEL QKENSKRWNP 

AAV_1 LIKNTPVPA NPPAEFSATKF ASFITQYSTG QVSVEIEWEL QKENSKRWNP 

AAV_3 MIKNTPVPA NPPTTFSPAKF ASFITQYSTG QVSVEIEWEL QKENSKRWNP 

AAV_9 LIKNTPVPA DgPLTJTNQAgL NSFITQYSTG QVSVEIEWEL QKENSKRWNP 



701 739 

AAV_2 EIQYTSNYNK SVNVDFTVDT NGVYSEPRPI GTRYLTRNL 

AAV_7 EIQYTSNFEK QTGVDFAVDS QGVYSEPRPI GTRYLTRNL 

AAV_8 EIQYTSNYYK STSVDFAVNT EGVYSEPRPI GTRYLTRNL 

AAV_1 EVQYTSNYAK SANVDFTVDN NGLYTEPRPI GTRYLTRPL 

AAV_3 EIQYTSNYNK SVNVDFTVDT NGVYSEPRPI GTRYLTRNL 

AAV 9 EIQYTSNYYK STNVDFAVNT EGVYSEPRPI GTRYLTRNL 
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Met Pro Gly Phe Tyr Glu lie Val He Lys Val Pro Ser Asp Leu Asp 
1 5 10 15 



Glu His Leu Pro Gly lie Ser Asp Ser Phe Val Asn Trp Val Ala Glu 
20 25 30 



Lys Glu Trp Glu Leu Pro Pro Asp Ser Asp Met Asp Arg Asn Leu He 
35 40 45 



Glu Gin Ala Pro Leu Thr Val Ala Glu Lys Leu Gin Arg Asp Phe Leu 
50 55 60 



Val Gin Trp Arg Arg Val Ser Lys Ala Pro Glu Ala Leu Phe Phe Val 
65 * 70 75 80 



Gin Phe Glu Lys Gly Glu Ser Tyr Phe His Leu His Val Leu Val Glu 
85 90 95 



Thr Thr Gly Val Lys Ser Met Val Leu Gly Arg Phe Leu Ser Gin He. 
100 105 110 



Arg Glu Lys Leu Gly Pro Asp His Leu Pro Ala Gly Ser Ser Pro Thr 
115 120 125 



Leu Pro Asn Trp Phe Ala Val Thr Lys Asp Ala Val Met Ala Pro Ala 
130 135 140 



Gly Gly Asn Lys Val Val Asp Glu Cys Tyr He Pro Asn Tyr Leu Leu 
145 150 155 160 



Pro Lys Thr Gin Pro Glu Leu Gin Trp Ala Trp Thr Asn Met Glu Glu 
165 170 175 



Tyr lie Ser Ala Cys Leu Asn Leu Ala Glu Arg Lys Arg Leu Val Ala 
180 185 190 



Gin His Leu Thr His Val Ser Gin Thr Gin Glu Gin Asn Lys Glu Asn 
195 200 205 



Leu Asn Pro Asn Ser Asp Ala Pro Val He Arg Ser Lys Thr Ser Ala 
210 215 220 



WO 03/05205 1 PCT/US02/33630 

Fig. 3B 

Arg Tyr Met Glu Leu Val Gly Trp Leu Val Asp Arg Gly He Thr Ser 
225 230 235 " ' 240 



Glu Lys Gin Trp lie GlfTGlu Asp Gin Ala Ser Tyr He Ser Phe Asn 
2 *5 250 255 



Ala Ala Ser Asn Ser Arg Ser Gin He Lys Ala Ala Leu Asp Asn Ala 
260 265 270 



Gly Lys He Met Ala Leu Thr Lys Ser Ala Pro Asp Tyr Leu Val Gly 
275 280 285 



Pro Ser Leu Pro Ala Asp He Thr Gin Asn Arg He Tyr Arg He Leu 
290 295 300 



Ala Leu Asn Gly Tyr Asp Pro Ala Tyr Ala Gly Ser Val Phe Leu Gly 
305 310 315 320 



Trp Ala Gin Lys Lys Phe Gly Lys Arg Asn Thr lie Trp Leu Phe Gly 
325 330 1 335 



Pro Ala Thr Thr Gly Lys Thr Asn He Ala Glu Ala He Ala His Ala 
340 345 350 



Val Pro Phe Tyr Gly Cys Val Asn Trp Thr Asn Glu Asn Phe Pro Phe 
355 360 365 



Asn Asp Cys Val Asp Lys Met Val He Trp Trp Glu Glu Gly Lys Met 
370 375 380 



Thr Ala Lys Val Val Glu Ser Ala Lys Ala lie Leu Gly Gly Ser Lys 
385 390 395 ~ 400 



Val Arg Val Asp Gin Lys Cys Lys Ser Ser Ala Gin lie Asp Pro Thr 
405 410 415 



Pro Val He Val Thr Ser Asn Thr Asn Met Cys Ala Val He Asp Gly 
4 20 4 25 " 4 30 



Asn Ser Thr Thr Phe Glu His Gin Gin Pro Leu Gin Asp Arg Met Phe 
435 440 445 
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Fig. 3C 



Lys Phe Glu Leu Thr Arg Arg Leu Glu His Asp Phe Gly Lys Val Thr 
450 455 460 



Lys Gin Glu Val Lys Glu Phe Phe Arg Trp Ala Ser Asp His Val Thr 
465 470 475 480 



Glu Val Ala His Glu Phe Tyr Val Arg Lys Gly Gly Ala Ser Lys Arg 
485 490 " " 495 



Pro Ala Pro Asp Asp Ala Asp Lys Ser Glu Pro Lys Arg Ala Cys Pro 
500 505 510 



Ser Val Ala Asp Pro Ser Thr Ser Asp Ala Glu Gly Ala Pro Val Asp 
515 520 525 



Phe Ala Asp Arg Tyr Gin Asn Lys Cys Ser Arg His Ala Gly Met Leu 
530 535 540 



Gin Met Leu Phe Pro Cys Lys Thr Cys Glu Arg Met Asn Gin Asn Phe 
545 550 " 555 560 



Asn lie Cys Phe Thr His Gly Val Arg Asp Cys Ser Glu Cys Phe Pro 
565 570 ~ 575 



Gly Val Ser Glu Ser Gin Pro Val Val Arg Lys Arg Thr Tyr Arg Lys 
580 585 590 



Leu Cys Ala lie His His Leu Leu Gly Arg Ala Pro Glu lie Ala Cys 
595 600 605 



Ser Ala Cys Asp Leu Val Asn Val Asp Leu Asp Asp Cys Val Ser Glu 
610 615 620 



Gin 
625 



o/o 
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<150> US 60/341,151 
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<212> DNA 

<213> adeno-associated virus serotype 8 

<400> 1 



cagagaggga 


gtggccaact 


ccatcactag 


gggtagcgcg aagcgcctcc 


cacgctgccg 


60 


cgtcagcgct 


gacgtaaatt 


acgtcatagg 


ggagtggtcc tgtattagct 


gtcacgtgag 


120 


tgcttttgcg 


gcattttgcg 


acaccacgtg 


gccatttgag gtatatatgg 


ccgagtgagc 


180 


gagcaggatc 


tccattttga 


ccgcgaaatt 


tgaacgagca gcagccatgc 


cgggcttcta 


240 


cgagatcgtg 


atcaaggtgc 


cgagcgacct 


ggacgagcac ctgccgggca 


tttctgactc 


300 


gtttgtgaac 


tgggtggccg 


agaaggaatg 


ggagctgccc ccggattctg 


acatggatcg 


360 


gaatctgatc 


gagcaggcac 


ccctgaccgt 


ggccgagaag ctgcagcgcg 


acttcctggt 


420 


ccaatggcgc 


cgcgtgagta 


aggccccgga 


ggccctcttc tttgttcagt 


tcgagaaggg 


480 


cgagagctac 


tttcacctgc 


acgttctggt 


cgagaccacg ggggtcaagt 


ccatggtgct 


540 


aggccgcttc 


ctgagtcaga 


ttcgggaaaa 


gcttggtcca gaccatctac 


ccgcggggtc 


600 


gagccccacc 


ttgcccaact 


ggttcgcggt 


gaccaaagac gcggtaatgg 


cgccggcggg 


660 


ggggaacaag 


gtggtggacg 


agtgctacat 


ccccaactac ctcctgccca 


agactcagcc 


720 


cgagctgcag 


tgggcgtgga 


ctaacatgga 


ggagtatata agcgcgtgct 


tgaacctggc 


780 
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cgagcgcaaa 


cggctcgtgg 


cgcagcacct 


gacccacgtc agccagacgc 


aggagcagaa 


840 


caaggagaat ctgaacccca attctgacgc gcccgtgatc aggtcaaaaa cctccgcgcg 


900 


ctatatggag ctggtcgggt ggctggtgga _ccgggg^atc acctccgaga agcagtggat 


960 


ccaggaggac 


caggcctcgt 


acatctcctt 

t*. v.* w w L- 


oaauy LL.y ll LUL.aaL L UU L. 


y y LULUCtLja L» 


1020 


caaggccgcg 


ct ggacaatg 


cccrcr ca a rra t* 


c ^ cr c\ C ci c t~ ct ar , r , aaat"pr«rt 
LauyyL.yL.Ly auLaaaLLLU 


c* ci c c* c*ct a r* 1 i~ a 


1080 


cctggtgggg 


ccctcgct gc 


ccciccfcra pa t* 


tacccagaac cgcatctacc 


gcatcctcgc 


1140 


tctcaacggc 


tacgaccctg 


ccfcaccrpcrrcr 


ctccgtcttt ctcggctggg 


ctcagaaaaa 


1200 


ottcaaaaaa 

\~ \J ^ ^ \-A \-X W. 


V_*» y V— » t4 fct, Vp<* V-^ W4 


t c t cr n c t~ rr t* t~ 


tggacccgcc accaccggca 


agaccaacat 


1260 


tacccraaacc 

^ \£ y ^* y 


atccrcccaccf 

c* v»> y w y 


pprft"rfr , r'r , t"'t" 


ctacggctgc gtcaactgga 


ccaatgagaa 


1320 


ctt t ccctt c 


aatcfattcrccr 

ci n y c* v» l» y <w y 


uuy amaua i— 


ggtgatctgg tgggaggagg 


gcaagatgac 


1380 


cfcrccaaacrt* c 


y ty y ay i« i_» i~y 


ccz\ a <*r rr i* - * a +■ 
Luaayy ( — L- ct l. 


tctcggcggc agcaaggtgc 


gcgtggacca 


1440 


aaagtgcaag 


t cgtccgccc 


acratccracpp 

u y u uvu u *^ w 


cacccccgtg atcgtcacct 


ccaacaccaa 


1500 


catgtgcgcc 


gtgattgacg 


ggaacagcac 


caccttcgag caccagcagc 


ctctccagga 


1560 


ccggatgttt 


aagt t cgaac 


tcaccccrcccr 

W U. ^# \_J 


tctggagcac gactttggca aggtgacaaa 


1620 


gcaaqaaatc 


aaagagttct 


tccgctgggc 


cagtgatcac gtgaccgagg 


tggcgcatga 


1680 


gttttacgtc 


agaaagggcg 


gagccagcaa 


aagacccgcc cccgatgacg 


cggataaaag 


1740 


cgagcccaag 


cgggcctgcc 


cctcagtcgc 


ggatccatcg acgtcagacg 


cggaaggagc 


1800 


tccggtggac 


tttgccgaca 


ggtaccaaaa 


caaatgttct cgtcacgcgg gcatgcttca 


1860 


gatgctgttt 


ccctgcaaaa 


cgtgcgagag 


aatgaatcag aatttcaaca 


tttgcttcac 


1920 


acacggggtc 


agagactgct 


cagagtgttt 


ccccggcgtg tcagaatctc 


aaccggtcgt 


1980 


cagaaagagg 


acgtatcgga 


aactctgtgc 


gattcatcat ctgctggggc 


gggctcccga 


2040 


gattgcttgc 


tcggcctgcg 


atctggtcaa 


cgtggacctg gatgactgtg 


tttctgagca 


2100 


ataaatgact 


taaaccaggt 


atggctgccg 


atggttatct tccagattgg 


ctcgaggaca 


2160 


acctctctga 


gggcattcgc gagtggtggg 


cgctgaaacc tggagccccg 


aagcccaaag 


2220 


ccaaccagca 


aaagcaggac 


gacggccggg 


gtctggtgct tcctggctac 


aagtacctcg 


2280 


gacccttcaa 


cggactcgac 


aagggggagc 


ccgtcaacgc ggcggacgca 


gcggccctcg 


2340 


agcacgacaa 


ggcctacgac 


cagcagctgc 


aggcgggtga caatccgtac 


ctgcggtata 


2400 


accacgccga 


cgccgagttt 


caggagcgtc 


tgcaagaaga tacgtctttt 


gggggcaacc 


2460 


tcgggcgagc agtcttccag gccaagaagc gggttctcga acctctcggt ctggttgagg 


2520 
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aaggcgctaa 


gacggctcct 


ggaaagaaga 


gaccggtaga 


gccatcaccc 


cagcgttctc 


2580 


cagactcctc 


tacgggcatc 


ggcaagaaag 


gccaacagcc 


cgccagaaaa 


agactcaatt 


2640 


ttggtcagac 


tggcgactca 


gagtcagttc 


cagaccctca 


acctctcgga 


gaacctccag 


2700 


cagcgccctc 


tggtgtggga 


cctaatacaa 


tggctgcagg 


cggtggcgca 


ccaatggcag 


2760 


acaataacga 


aggcgccgac 


ggagtgggta 


gttcctcggg 


aaattggcat 


tgcgattcca 


2820 


catggctggg 


cgacagagtc 


atcaccacca 


gcacccgaac 


ctgggccctg 


cccacctaca . 


2880 


acaaccacct 


ctacaagcaa 


atctccaacg 


ggacatcggg 


aggagccacc 


aacgacaaca 


2940 


cctacttcgg 


ctacagcacc 


ccctgggggt 


attttgactt 


taacagattc 


cactgccact 


3000 


tttcaccacg 


tgactggcag 


cgactcatca 


acaacaactg 


gggattccgg 


cccaagagac 


3060 


tcagcttcaa 


gctcttcaac 


atccaggtca 


aggaggtcac 


gcagaatgaa 


ggcaccaaga 


3120 


ccatcgccaa 


taacctcacc 


agcaccatcc 


aggtgtttac 


ggactcggag 


taccagctgc 


3180 


cgtacgttct 


cggctctgcc 


caccagggct 


gcctgcctcc 


gttcccggcg 


gacgtgttca 


3240 


tgattcccca 


gtacggctac 


ctaacactca 


acaacggtag 


tcaggccgtg 


ggacgctcct 


3300 


ccttctactg 


cctggaatac 


tttccttcgc 


agatgctgag 


aaccggcaac 


aacttccagt 


3360 


ttacttacac 


cttcgaggac 


gtgcctttcc 


acagcagcta 


cgcccacagc 


cagagcttgg 


3420 


accggctgat 


gaatcctctg 


attgaccagt 


acctgtacta 


cttgtctcgg 


actcaaacaa 


3480 


caggaggcac 


ggcaaatacg 


cagactctgg 


gcttcagcca 


aggtgggcct 


aatacaatgg 


3540 


ccaatcaggc 


aaagaactgg 


ctgccaggac 


cctgttaccg 


ccaacaacgc 


gtctcaacga 


3600 


caaccgggca 


aaacaacaat 


agcaactttg 


cctggactgc 


tgggaccaaa 


taccatctga 


3660 


atggaagaaa 


ttcattggct 


aatcctggca 


tcgctatggc 


aacacacaaa 


gacgacgagg 


3720 


agcgtttttt 


tcccagtaac 


gggatcctga 


tttttggcaa 


acaaaatgct 


gccagagaca 


3780 


atgcggatta 


cagcgatgtc 


atgctcacca 


gcgaggaaga 


aatcaaaacc 


actaaccctg 


3840 


tggctacaga 


ggaatacggt 


atcgtggcag 


ataacttgca 


gcagcaaaac 


acggctcctc 


3900 


aaattggaac 


tgtcaacagc 


cagggggcct 


tacccggtat 


ggtctggcag 


aaccgggacg 


3960 


tgtacctgca 


gggtcccatc 


tgggccaaga 


ttcctcacac 


ggacggcaac 


ttccacccgt 


4020 


ctccgctgat 


gggcggcttt 


ggcctgaaac 


atcctccgcc 


tcagatcctg 


atcaagaaca 


4080 


cgcctgtacc 


tgcggatcct 


ccgaccacct 


tcaaccagtc 


aaagctgaac 


tctttcatca 


4140 


cgcaatacag 


caccggacag 


gtcagcgtgg 


aaattgaatg 


ggagctgcag 


aaggaaaaca 


4200 
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gcaagcgctg gaaccccgag atccagtaca cctccaacta ctacaaatct acaagtgtgg 4260 

actttgctgt taatacagaa ggcgtgtact ctgaaccccg ccccattggc acccgttacc 4320 

tcacccgtaa tctgtaattg cctgttaatrL-aataaaccgg ttgattcgtt tcagttgaac 4380 

tttggtctct gcg 4393 

<210> 2 
<211> 738 
<212> PRT 

<213> capsid protein of adeno-associated virus serotype 8 
<400> 2 

Met Ala Ala Asp Gly Tyr Leu Pro Asp Trp Leu Glu Asp Asn Leu Ser 
15 10 15 

Glu Gly lie Arg Glu Trp Trp Ala Leu Lys Pro Gly Ala Pro Lys Pro 
20 25 " " 30 

Lys Ala Asn Gin Gin Lys Gin Asp Asp Gly Arg Gly Leu Val Leu Pro 
35 40 45 

Gly Tyr Lys Tyr Leu Gly Pro Phe Asn Gly Leu Asp Lys Gly Glu Pro 
50 55 60 

Val Asn Ala Ala Asp Ala Ala Ala Leu Glu His Asp Lys Ala Tyr Asp 
65 70 75 80 

Gin Gin Leu Gin Ala Gly Asp Asn Pro Tyr Leu Arg Tyr Asn His Ala 
85 90 ~ 95 

Asp Ala Glu Phe Gin Glu Arg Leu Gin Glu Asp Thr Ser Phe Gly Gly 
100 105 110 

Asn Leu Gly Arg Ala Val Phe Gin Ala Lys Lys Arg Val Leu Glu Pro 
115 120 " ~ 125 

Leu Gly Leu Val Glu Glu Gly Ala Lys Thr Ala Pro Gly Lys Lys Arg 
130 135 140 

Pro Val Glu Pro Ser Pro Gin Arg Ser Pro Asp Ser Ser Thr Gly lie 
145 150 155 160 

Gly Lys Lys Gly Gin Gin Pro Ala Arg Lys Arg Leu Asn Phe Gly Gin 
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165 



170 



175 



Thr Gly Asp Ser Glu Ser Val Pro Asp Pro Gin Pro Leu Gly Glu Pro 
, 180 185 190 



Pro Ala Ala Pro Ser Gly Val Gly Pro Asn Thr Met Ala Ala Gly Gly 
195 200 205 



Gly Ala Pro Met Ala Asp Asn Asn Glu Gly Ala Asp Gly Val Gly Ser 
210 215 220 



Ser Ser Gly Asn Trp His Cys Asp Ser Thr Trp Leu Gly Asp Arg Val 
225 230 235 " 240 



lie Thr Thr Ser Thr Arg Thr Trp Ala Leu Pro Thr Tyr Asn Asn His 
245 250 255 



Leu Tyr Lys Gin lie Ser Asn Gly Thr Ser Gly Gly Ala Thr Asn Asp 
260 265 ' " 270 



Asn Thr Tyr Phe.Gly Tyr Ser Thr Pro Trp Gly Tyr Phe Asp Phe Asn 
275 280 285 



Arg Phe His Cys His Phe Ser Pro Arg Asp Trp Gin Arg Leu He Asn 
290 295 300 



Asn Asn Trp Gly Phe Arg Pro Lys Arg Leu Ser Phe Lys Leu Phe Asn 
305 310 315 320 



He Gin Val Lys Glu Val Thr Gin Asn Glu Gly Thr Lys Thr He Ala 
325 330 335 



Asn Asn Leu Thr Ser Thr He Gin Val Phe Thr Asp -Ser Glu Tyr Gin 
340 345 350 



Leu Pro Tyr Val Leu Gly Ser Ala His Gin Gly Cys Leu Pro Pro Phe 
355 360 ' ~ 365 



Pro Ala Asp Val Phe Met He Pro Gin Tyr Gly Tyr Leu Thr Leu Asn 
370 375 * 380 



Asn Gly Ser Gin Ala Val Gly Arg Ser Ser Phe Tyr Cys Leu Glu Tyr 
385 390 ~ " 395 ~ - 400 



Oprto ^ 
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Phe Pro Ser Gin Met Leu Arg Thr Gly Asn Asn Phe Gin Phe Thr Tyr 
405 " 410 415 



Thr Phe Glu Asp Val Pro Phe His Ser Ser Tyr Ala His Ser Gin Ser 
420 425 " 430 



Leu Asp Arg Leu Met Asn Pro Leu lie Asp Gin Tyr Leu Tyr Tyr Leu 
435- 440 " 445 



Ser Arg Thr Gin Thr Thr Gly Gly Thr Ala Asn Thr Gin Thr Leu Gly 
450 . 455 460 



Phe Ser Gin Gly Gly Pro Asn Thr Met Ala Asn Gin Ala Lys Asn Trp 
465 ' 470 475 " 480 



Leu Pro Gly Pro Cys Tyr Arg Gin Gin Arg Val Ser Thr Thr Thr Gly 
485 490 495 



Gin Asn Asn Asn Ser Asn Phe Ala Trp Thr Ala Gly Thr Lys Tyr His 
500 • 505 510 



Leu Asn Gly Arg Asn Ser Leu Ala Asn Pro Gly lie Ala Met Ala Thr 
515 520 ^ 525 



His Lys Asp Asp Glu Glu Arg Phe Phe Pro Ser Asn Gly lie Leu lie 
530 535 540 



Phe Gly Lys Gin Asn Ala Ala Arg Asp Asn Ala Asp Tyr Ser Asp Val 
545 " " 550 " 555 ^ * 560 



Met Leu Thr Ser Glu Glu Glu lie Lys Thr Thr Asn Pro Val Ala Thr 
565 570 575 



Glu Glu Tyr Gly lie Val Ala Asp Asn Leu Gin Gin Gin Asn Thr Ala 
580 585 590 



Pro Gin He Gly Thr Val Asn Ser Gin Gly Ala Leu Pro Gly Met Val 
595 600 605 



Trp Gin Asn Arg Asp Val Tyr Leu Gin Gly Pro He Trp Ala Lys He 
610 615 620 
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Pro His Thr Asp Gly Asn Phe His Pro Ser Pro Leu Met Gly Gly Phe 
625 630 635 640 



Gly Leu Lys His Pro Pro Pro Gin lie Leu lie Lys Asn Thr Pro Val 
645 650 655 



Pro Ala Asp Pro Pro Thr Thr Phe Asn Gin Ser Lys Leu Asn Ser Phe 
660 665 670 



He Thr Gin Tyr Ser Thr Gly Gin Val Ser Val Glu He Glu Trp Glu 
675 680 685 



Leu Gin Lys Glu Asn Ser Lys Arg Trp Asn Pro Glu He Gin Tyr Thr 
690 695 700 . 



Ser Asn Tyr Tyr Lys Ser Thr Ser Val Asp Phe Ala Val Asn Thr Glu 
705 710 715 720 



Gly Val Tyr Ser Glu Pro Arg Pro He Gly Thr Arg Tyr Leu Thr Arg 
725 730 735 



Asn Leu 



<210> 3 

<211> 625 

<212> PRT 

<213> rep protein of adeno-associated virus serotype 8 

<400> 3 

Met Pro Gly Phe Tyr Glu He Val He Lys Val Pro Ser Asp Leu Asp 
1 5 10 15 



Glu His Leu Pro Gly He Ser Asp Ser Phe Val Asn Trp Val Ala Glu 
20 25 30 



Lys Glu Trp Glu Leu Pro Pro Asp Ser Asp Met Asp Arg Asn Leu He 
35 40 45 



Glu Gin Ala Pro Leu Thr Val Ala Glu Lys Leu Gin Arg Asp Phe Leu 
50 55 60 



Val Gin Trp Arg Arg Val Ser Lys Ala Pro Glu Ala Leu Phe Phe Val 
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65 



70 



75 



80 



Gin Phe Glu Lys Gly Glu Sef Tyr Phe His Leu His Val Leu Val Glu 
85 90 -95 



Thr Thr Gly Val Lys Ser Met Val Leu Gly Arg Phe Leu Ser Gin lie 
100 105 110 



Arg Glu Lys Leu Gly Pro Asp His Leu Pro Ala Gly Ser Ser Pro Thr 
115 120 125 



Leu Pro Asn Trp Phe Ala Val Thr Lys Asp Ala Val Met Ala Pro Ala 
130 135 140 



Gly Gly Asn Lys Val Val Asp Glu Cys Tyr lie Pro Asn Tyr Leu Leu 
145 150 155 160 



Pro Lys Thr Gin Pro Glu Leu Gin Trp Ala Trp Thr Asn Met Glu Glu 
165 170 175 



Tyr lie Ser Ala Cys Leu Asn Leu Ala Glu Arg Lys Arg Leu Val Ala 
180 185 190 



Gin His Leu Thr His Val Ser Gin Thr Gin Glu Gin Asn Lys Glu Asn 
195 200 205 



Leu Asn Pro Asn Ser Asp Ala Pro Val lie Arg Ser Lys Thr Ser Ala 
210 215 220 



Arg Tyr Met Glu Leu Val Gly Trp Leu Val Asp Arg Gly lie Thr Ser 
225 230 235 240 



Glu Lys Gin Trp lie Gin Glu Asp Gin Ala Ser Tyr lie Ser Phe Asn 
245 * 250 " 255 



Ala Ala Ser Asn Ser Arg Ser Gin lie Lys Ala Ala Leu Asp Asn Ala 
260 265 " 270 



Gly Lys lie Met Ala Leu Thr Lys Ser Ala Pro Asp Tyr Leu Val Gly 
275 280 * 285 



Pro Ser Leu Pro Ala Asp lie Thr Gin Asn Arg lie Tyr Arg lie Leu 
290 295 300 
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Ala Leu Asn Gly Tyr Asp Pro Ala Tyr Ala Gly Ser- Val Phe Leu Gly 
305 310 " 315 320 



Trp Ala Gin Lys Lys Phe Gly Lys Arg Asn Thr lie Trp Leu Phe Gly 
325 ~ 330 335 



Pro Ala Thr Thr Gly Lys Thr Asn He Ala Glu Ala He Ala His Ala 
340 345 350 



Val Pro Phe Tyr Gly Cys Val Asn Trp Thr Asn Glu Asn Phe Pro Phe 
355 360 365 



Asn Asp Cys Val Asp Lys Met Val He Trp Trp Glu Glu Gly Lys Met 
370 " 375 380 



Thr Ala Lys Val Val Glu .Ser Ala Lys Ala He Leu Gly Gly Ser Lys 
385 390 395 400 



Val Arg Val Asp Gin Lys Cys Lys Ser Ser Ala Gin He Asp Pro Thr 
405 410 415 



Pro Val He Val Thr Ser Asn Thr Asn Met Cys Ala Val He Asp Gly 
420 425 430 



Asn Ser Thr Thr Phe Glu His Gin Gin Pro Leu Gin Asp Arg Met Phe 
435 440 445 



Lys Phe Glu Leu Thr Arg Arg Leu Glu His Asp Phe Gly Lys Val Thr 
450 455 460 



Lys Gin Glu Val Lys Glu Phe Phe Arg Trp Ala Ser Asp His Val Thr 
465 470 475 480 



Glu Val Ala His Glu Phe Tyr Val Arg Lys Gly Gly Ala Ser Lys Arg 
485 " 490 " 495 



Pro Ala Pro Asp Asp Ala Asp Lys Ser Glu Pro Lys Arg Ala Cys Pro 
500 ^ ^ .505 510 



Ser Val Ala Asp Pro Ser Thr Ser Asp Ala Glu Gly Ala Pro Val Asp 
515 520 525 
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Phe Ala Asp Arg Tyr Gin Asn Lys Cys Ser Arg His Ala Gly Met Leu 
530 " " 535 540 



Gin Met Leu Phe Pro Cys Lys Thr Cys Glu Arg Met Asn Gin Asn Phe 
545 550 " * 555 560 



Asn lie Cys Phe Thr His Gly Val Arg Asp Cys Ser Glu Cys Phe Pro 
565 570 • 575 



Gly Val Ser Glu Ser Gin Pro Val Val Arg Lys Arg Thr Tyr Arg Lys 
580 585 590 



Leu Cys Ala lie His His Leu Leu Gly Arg Ala Pro Glu lie Ala Cys 
595 600 605 



Ser Ala Cys Asp Leu Val Asn Val Asp Leu Asp Asp Cys Val Ser Glu 
610 " 615 620 



Gin 
625 



<210> 4 

<211> 735 

<212> PRT 

<213> adeno-associated virus serotype 2 

<400> 4 

Met Ala Ala Asp Gly Tyr Leu Pro Asp Trp Leu Glu Asp Thr Leu Ser 
1 5 10 15 



Glu Gly lie Arg Gin Trp Trp Lys Leu Lys Pro Gly Pro Pro Pro Pro 
20 25 30 



Lys Pro Ala Glu Arg His Lys Asp Asp Ser Arg Gly Leu Val Leu Pro 
35 40 " 45 



Gly Tyr Lys Tyr Leu Gly Pro Phe Asn Gly Leu Asp Lys Gly Glu Pro 
50 " 55 60 



Val Asn Glu Ala Asp Ala Ala Ala Leu Glu His Asp Lys Ala Tyr Asp 
65 70 75 ^ 80 



Arg Gin Leu Asp Ser Gly Asp Asn Pro Tyr Leu Lys Tyr Asn His Ala 
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85 



90 



95 



Asp Ala Glu Phe Gin Glu Arg Leu Lys Glu Asp Thr Ser Phe Gly Gly 
100 ~ 105 110 



Asn Leu Gly Arg Ala Val Phe Gin Ala Lys Lys Arg Val Leu Glu Pro 
115 120 125 



Leu Gly Leu Val Glu Glu Pro Val Lys Thr Ala Pro Gly Lys Lys Arg 
130 135 140 



Pro Val Glu His Ser Pro Val Glu Pro Asp Ser Ser Ser Gly Thr Gly 
145 150 155 160 



Lys Ala Gly Gin Gin Pro Ala Arg Lys Arg Leu Asn Phe Gly Gin Thr 
165 170 175 



Gly Asp Ala Asp Ser Val Pro Asp Pro Gin Pro Leu Gly Gin Pro Pro 
180 185 190 



Ala Ala Pro Ser Gly. Leu Gly Thr Asn Thr Met Ala Thr Gly Ser Gly 
195 * 200 205 



Ala Pro Met Ala Asp Asn Asn Glu Gly Ala Asp Gly Val Gly Asn Ser 
210 215 220 



Ser Gly Asn Trp His Cys Asp Ser Thr Trp Met Gly Asp Arg Val lie 
225 230 235 240 



Thr Thr Ser Thr Arg Thr Trp Ala Leu Pro Thr Tyr Asn Asn His Leu 
245 ~ 250 255 



Tyr Lys Gin lie Ser Ser Gin Ser Gly Ala Ser Asn Asp Asn His Tyr 
260 265 270 



Phe Gly Tyr Ser Thr Pro Trp Gly Tyr Phe Asp Phe Asn Arg Phe His 
275 280 285 



Cys His Phe Ser Pro Arg Asp Trp Gin Arg Leu lie Asn Asn Asn Trp 
290 295 300 



Gly Phe Arg Pro Lys Arg Leu Asn Phe Lys Leu Phe Asn lie Gin Val 
305 310 315 320 



Parro 1 1 



WO 03/052051 



PCT/US02/33630 



Lys Glu Val Thr Gin Asn Asp Gly Thr Thr Thr lie Ala Asn Asn Leu 
325 330 335 



Thr Ser Thr Val Gin Val Phe Thr Asp Ser Glu Tyr Gin Leu Pro Tyr 
340 345 350 



Val Leu Gly Ser Ala His Gin Gly Cys Leu Pro Pro Phe Pro Ala Asp 
355 360 365 



Val Phe Met Val Pro Gin Tyr Gly Tyr Leu Thr Leu Asn Asn Gly Ser 
370 375 380 



Gin Ala Val Gly Arg Ser Ser Phe Tyr Cys Leu Glu Tyr Phe Pro Ser 
385 390 395 400 



Gin Met Leu Arg Thr Gly Asn Asn Phe Thr Phe Ser Tyr Thr Phe Glu 
405 410 415 



Asp Val Pro Phe His Ser Ser Tyr Ala His Ser Gin Ser Leu Asp Arg 
420 425 430 



Leu Met Asn Pro Leu lie Asp Gin Tyr Leu Tyr Tyr Leu Ser Arg Thr 
435 440 445 



Asn Thr Pro Ser Gly Thr Thr Thr Gin Ser Arg Leu Gin Phe Ser Gin 
450 455 460 



Ala Gly Ala Ser Asp He Arg Asp Gin Ser Arg Asn Trp Leu Pro Gly 
465 470 475 480 



Pro Cys Tyr Arg Gin Gin Arg Val Ser Lys Thr Ser Ala Asp Asn Asn 
485 490 495 



Asn Ser Glu Tyr Ser Trp Thr Gly Ala Thr Lys Tyr His Leu Asn Gly 
500 505 510 



Arg Asp Ser Leu Val Asn Pro Gly Pro Ala Met Ala Ser His Lys Asp 
515 520 525 



Asp Glu Glu Lys Phe Phe Pro Gin Ser Gly Val Leu He Phe Gly Lys 
530 535 540 
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Gin Gly Ser Glu Lys Thr Asn Val Asp lie Glu Lys Val Met lie Thr 
545 " 550 555 560 



Asp Glu Glu Glu lie Arg Thr Thr Asn Pro Val Ala Thr Glu Gin Tyr 
565 570 575 



Gly Ser Val Ser Thr Asn Leu Gin Arg Gly Asn Arg Gin Ala Ala Thr 
580 585 590 



Ala Asp Val Asn Thr Gin Gly Val Leu Pro Gly Met Val Trp Gin Asp 
595 600 605 



Arg Asp Val Tyr Leu Gin Gly Pro lie Trp Ala Lys lie Pro His Thr 
610 615 620 



Asp Gly His Phe His Pro Ser Pro Leu Met Gly Gly Phe Gly Leu Lys 
625 " 630 635 640 



His Pro Pro Pro Gin lie Leu lie Lys Asn Thr Pro Val Pro Ala Asn 
645 650 655 



Pro Ser Thr Thr Phe Ser Ala Ala Lys Phe Ala Ser Phe He Thr Gin 
660 665 670 



Tyr Ser Thr Gly Gin Val Ser Val Glu He Glu Trp Glu Leu Gin Lys 
675 ^ 680 685 



Glu Asn Ser Lys Arg Trp Asn Pro Glu He Gin Tyr Thr Ser Asn Tyr 
690 695 700 



Asn Lys Ser Val Asn Val Asp Phe Thr Val Asp Thr Asn Gly Val Tyr 
705 ~ 710 715 720 



Ser Glu Pro Arg Pro He Gly Thr Arg Tyr Leu Thr Arg Asn Leu 
725 730 735 



<210> 5 
<211> 736 
<212> PRT 

<213> adeno-associated virus serotype 1 
<400> 5 

Met Ala Ala Asp Gly Tyr Leu Pro Asp Trp Leu Glu Asp Asn Leu Ser 
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10 



15 



Glu Gly lie Arg Glu Trp Trp Asp Leu Lys Pro Gly Ala Pro Lys Pro 
20 25 30 



Lys Ala Asn Gin Gin Lys Gin Asp Asp Gly Arg Gly Leu Val Leu Pro 
35 40 45 



Gly Tyr Lys Tyr Leu Gly Pro Phe Asn Gly Leu Asp Lys Gly Glu Pro 
50 55 60 



Val Asn Ala Ala Asp Ala Ala Ala Leu Glu His Asp Lys Ala Tyr Asp 
65 70 75 80 



Gin Gin Leu Lys Ala Gly Asp Asn Pro Tyr Leu Arg Tyr Asn His Ala 
85 90 95 



Asp Ala Glu Phe Gin Glu Arg Leu Gin Glu Asp Thr Ser Phe Gly Gly 
100 105' 110 



Asn Leu Gly Arg Ala Val Phe Gin Ala Lys Lys Arg Val Leu Glu Pro 
115 ~ 120 125 



Leu Gly Leu Val Glu Glu Gly Ala Lys Thr Ala Pro Gly Lys Lys Arg 
130 135 140 



Pro Val Glu Gin Ser Pro Gin Glu Pro Asp Ser Ser Ser Gly lie Gly 
145 150 155 160 



Lys Thr Gly Gin Gin Pro Ala Lys Lys Arg Leu Asn Phe Gly Gin Thr 
165 170 175 



Gly Asp Ser Glu Ser Val Pro Asp Pro Gin Pro Leu Gly Glu Pro Pro 
180 185 190 



Ala Thr Pro Ala Ala Val Gly Pro Thr Thr Met Ala Ser Gly Gly Gly 
195 200 205 



Ala Pro Met Ala Asp Asn Asn Glu Gly Ala Asp Gly Val Gly Asn Ala 
210 215 220 



Ser Gly Asn Trp His Cys Asp Ser Thr Trp Leu Gly Asp Arg Val lie 
225 ^ 230 235 240 
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Thr Thr Ser Thr Arg Thr Trp Ala Leu Pro Thr Tyr Asn Asn His Leu 
245 250 255 



Tyr Lys Gin He Ser Ser Ala Ser Thr Gly Ala Ser Asn Asp Asn His 
260 • 265 270 



Tyr Phe Gly Tyr Ser Thr Pro Trp Gly Tyr Phe Asp Phe Asn Arg Phe 
275 280 285 



His Cys His Phe Ser Pro Arg Asp Trp Gin Arg Leu He Asn Asn Asn 
290 295 300 



Trp Gly Phe Arg Pro Lys Arg Leu Asn Phe Lys Leu Phe Asn He Gin 
305 . 310 315 320 



Val Lys Glu Val Thr Thr Asn Asp Gly Val Thr Thr He Ala Asn Asn 
325 330 335 



Leu Thr Ser Thr Val Gin Val Phe Ser Asp Ser Glu Tyr Gin Leu Pro 
340 345 350 



Tyr Val Leu Gly Ser Ala His Gin Gly Cys Leu Pro Pro Phe Pro Ala 
355 360 365 



Asp Val Phe Met lie Pro Gin Tyr Gly Tyr Leu Thr Leu Asn Asn Gly 
370 375 380 



Ser Gin Ala Val Gly Arg Ser Ser Phe Tyr Cys Leu Glu Tyr Phe Pro 
385 ^ 390 395 400 



Ser Gin Met Leu Arg Thr Gly Asn Asn Phe Thr Phe Ser Tyr Thr Phe 
405 410 415 



Glu Glu Val Pro Phe His Ser Ser Tyr Ala His Ser Gin Ser Leu Asp 
420 425 430 



Arg Leu Met Asn Pro Leu He Asp Gin Tyr Leu Tyr Tyr Leu Asn Arg 
435 440 445 



Thr Gin Asn Gin Ser Gly Ser Ala Gin Asn Lys Asp Leu Leu Phe Ser 
450 ^ 455 460 
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Arg Gly Ser Pro Ala Gly Met Ser Val Gin Pro Lys Asn Trp Leu Pro 
465 470 475 480 



Gly Pro Cys Tyr Arg Gin Gin Arg Val Ser Lys Thr Lys Thr Asp Asn 
485 490 495 



Asn Asn Ser Asn Phe Thr Trp Thr Gly Ala Ser Lys Tyr Asn Leu Asn 
500 505 510 



Gly Arg Glu Ser lie lie Asn Pro Gly Thr Ala Met Ala Ser His Lys 
515 520 525 • 



Asp Asp Glu Asp Lys Phe Phe Pro Met Ser Gly Val Met lie Phe Gly 
530 535 540 



Lys Glu Ser Ala Gly Ala Ser Asn Thr Ala Leu Asp Asn Val Met lie 
545 "* 550 555 560 



Thr Asp Glu Glu Glu lie Lys Ala Thr Asn Pro Val Ala Thr Glu Arg 
565 570 575 



Phe Gly Thr Val Ala Val Asn Phe Gin Ser Ser Ser Thr Asp Pro Ala 
580 585 590 



Thr Gly Asp Val His Ala Met Gly Ala Leu Pro Gly Met Val Trp Gin 
595 600 605 



Asp Arg Asp Val Tyr Leu Gin Gly Pro lie Trp Ala Lys lie Pro His 
610 * 615 620 



Thr Asp Gly His Phe His Pro Ser Pro Leu Met Gly Gly Phe Gly Leu 
625 ~ 630 635 640 



Lys Asn Pro Pro Pro Gin lie Leu lie Lys Asn Thr Pro Val Pro Ala 
645 650 655 



Asn Pro Pro Ala Glu Phe Ser Ala Thr Lys Phe Ala Ser Phe lie Thr 
660 665 670 



Gin Tyr Ser Thr Gly Gin Val Ser Val Glu lie Glu Trp Glu Leu Gin 
675 680 685 
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Lys Glu Asn Ser Lys Arg Trp Asn Pro Glu Val Gin Tyr Thr Ser Asn 
690 695 700 



Tyr Ala Lys Ser Ala Asn Val. Asp Ehe Thr Val Asp Asn Asn Gly Leu 
705 ~ 710 715 720 



Tyr Thr Glu Pro Arg Pro lie Gly Thr Arg Tyr Leu Thr Arg Pro Leu 
725 730 735 



<210> 6 

<211> 736 

<212> PRT 

<213> adeno-associated virus serotype 3 

<400> 6 

Met Ala Ala Asp Gly Tyr Leu Pro Asp Trp Leu Glu Asp Asn Leu Ser 
1 -5 10 15 



Glu Gly lie Arg Glu Trp Trp Ala Leu Lys Pro Gly Val Pro Gin Pro 
20 25 30 



Lys Ala Asn Gin Gin His Gin Asp Asn Arg Arg Gly Leu Val Leu Pro 
35 40 45 



Gly Tyr Lys Tyr Leu Gly Pro Gly Asn Gly Leu Asp Lys Gly Glu Pro 
50 " " 55 " 60 



Val Asn Glu Ala Asp Ala Ala Ala Leu Glu His Asp Lys Ala Tyr Asp 
65 70 75 80 



Gin Gin Leu Lys Ala Gly Asp Asn Pro Tyr Leu Lys Tyr Asn His Ala 
85 90 95 



Asp Ala Glu Phe Gin Glu Arg Leu Gin Glu Asp Thr Ser Phe Gly Gly 
100 105 110 



Asn Leu Gly Arg Ala Val Phe Gin Ala Lys Lys Arg He Leu Glu Pro 
115 120 125 



Leu Gly Leu Val Glu Glu Ala Ala Lys Thr Ala Pro Gly Lys Lys Gly 
130 135 140 



Ala Val Asp Gin Ser Pro Gin Glu Pro Asp Ser Ser Ser Gly Val Gly 
145 * 150 155 160 
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Lys Ser Gly Lys Gin Pro Ala Arg Lys Arg Leu Asn Phe Gly Gin Thr 
165 170 175 



Gly Asp Ser Glu Ser Val Pro Asp Pro Gin Pro Leu Gly Glu Pro Pro 
180 185 ' 190 



Ala Ala Pro Thr Ser Leu Gly Ser Asn Thr Met Ala Ser Gly Gly Gly 
195 200 205 



Ala Pro Met Ala Asp Asn Asn Glu Gly Ala Asp Gly Val Gly Asn Ser 
210 215 ~ 220 



Ser Gly Asn Trp His Cys Asp Ser Gin Trp Leu Gly Asp Arg Val lie 
225 230 235 " 240 



Thr Thr Ser Thr Arg Thr Trp Ala Leu. Pro Thr Tyr Asn Asn His Leu 
245 250 255 



Tyr Lys Gin lie Ser Ser Gin Ser Gly Ala Ser Asn Asp Asn His Tyr 
260 265 270 



Phe Gly Tyr Ser Thr Pro Trp Gly Tyr Phe Asp Phe Asn Arg Phe His 
• 275 280 285 



Cys His Phe Ser Pro Arg Asp Trp Gin Arg Leu lie Asn Asn Asn Trp 
290 295 300 



Gly Phe Arg Pro Lys Lys Leu Ser Phe Lys Leu Phe Asn lie Gin Val 
305 310 315 320 



Arg Gly Val Thr Gin Asn Asp Gly Thr Thr Thr lie Ala Asn Asn Leu 
325 330 335 



Thr Ser Thr Val Gin Val Phe Thr Asp Ser Glu Tyr Gin Leu Pro Tyr 
340 345 350 



Val Leu Gly Ser Ala His Gin Gly Cys Leu Pro Pro Phe Pro Ala Asp 
355 360 " 365 



Val Phe Met Val Pro Gin Tyr Gly Tyr Leu Thr Leu Asn Asn Gly Ser 
370 375 ~ 380 
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Gin Ala Val Gly Arg Ser Ser Phe Tyr Cys Leu Glu Tyr Phe Pro Ser 
385 390 395 400 



Gin Met Leu Arg Thr Gly Asn Asn Phe Gin Phe Ser Tyr Thr Phe Glu 
405 410 415 



Asp Val Pro Phe His Ser Ser Tyr Ala His Ser Gin Ser Leu Asp Arg 
420 425 430 



Leu Met Asn Pro Leu lie Asp Gin Tyr Leu Tyr Tyr Leu Asn Arg Thr 
435 440 445 



Gin Gly Thr Thr Ser Gly Thr Thr Asn Gin Ser Arg Leu Leu Phe Ser 
450 ^ 455 460 



Gin Ala Gly Pro Gin Ser Met Ser Leu Gin Ala Arg Asn Trp Leu Pro 
465 470 475 480 



Gly Pro Cys Tyr Arg Gin Gin. Arg Leu Ser Lys Thr Ala Asn Asp Asn 
485 490 495 



Asn Asn Ser Asn Phe Pro Trp Thr Ala Ala Ser Lys Tyr His Leu Asn 
500 505 510 



Gly Arg Asp Ser Leu Val Asn Pro Gly Pro Ala Met Ala Ser His Lys 
515 520 525 



Asp Asp Glu Glu Lys Phe Phe Pro Met His Gly Asn Leu lie Phe Gly 
530 " 535 540 



Lys Glu Gly Thr Thr Ala Ser Asn Ala Glu Leu Asp Asn Val Met lie 
545 550 -555 560 



Thr Asp Glu Glu Glu He Arg Thr Thr Asn Pro Val Ala Thr Glu Gin 
565 570 575 



Tyr Gly Thr Val Ala Asn Asn Leu Gin Ser Ser Asn Thr Ala Pro Thr 
580 585 590 



Thr Gly Thr Val Asn His Gin Gly Ala Leu Pro Gly Met Val Trp Gin 
595 600 605 
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Asp Arg Asp Val Tyr Leu Gin Gly Pro lie Trp Ala Lys lie Pro His 
610 " 615 620 



Thr Asp Gly: His Pfce His Pro Ser Pro Leu Met Gly Gly Phe^Gly Lau_ 
625 630 635 640 



Lys His Pro Pro Pro Gin lie Met lie Lys Asn Thr Pro Val Pro Ala 
645 650 655 



Asn Pro Pro Thr Thr Phe Ser Pro Ala Lys Phe Ala Ser Phe lie Thr 
660 665 670 



Gin Tyr Ser Thr Gly Gin Val Ser Val Glu lie Glu Trp Glu Leu Gin 
675 680 685 



Lys Glu Asn Ser Lys Arg Trp Asn Pro Glu He Gin Tyr Thr Ser Asn 
690 695 700 



Tyr Asn Lys Ser Val Asn Val Asp Phe Thr Val Asp Thr Asn Gly Val 
705 " 710 715 720 



Tyr Ser Glu Pro Arg Pro lie Gly Thr Arg Tyr Leu Thr Arg Asn Leu 
725 730 735 



<210> 7 

<211> 736 

<212> PRT 

<213> adeno-associated virus serotype 9 

<400> 7 

Met Ala Ala Asp Gly Tyr Leu Pro Asp Trp Leu Glu Asp Asn Leu Ser 
1 5 10 15 



Glu Gly lie Arg Glu Trp Trp Asp Leu Lys Pro Gly Ala Pro Lys Pro 
20 25 30 



Lys Ala Asn Gin Gin Lys Gin Asp Asp Gly Arg Gly Leu Val Leu Pro 
35 " 40 45 



Gly Tyr Lys Tyr Leu Gly Pro Phe Asn Gly Leu Asp Lys Gly Glu Pro 
50 ~ * ~ 55 . 60 



Val Asn Ala Ala Asp Ala Ala Ala Leu Glu His Asp Lys Ala Tyr Asp 
65 70 75 80 
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Gin Gin Leu Lys Ala Gly Asp Asn Pro Tyr Leu Arg Tyr Asn His Ala 
85 ~ 90 95 



Asp Ala Glu Phe Gin Glu Arg Leu Gin Glu Asp Thr Ser Phe Gly Gly 
100 105 110 



Asn Leu Gly Arg Ala Val Phe Gin Ala Lys Lys Arg Val Leu Glu Pro 
115 120 125 



Leu Gly Leu Val Glu Glu Gly Ala Lys Thr Ala Pro Gly Lys Lys Arg 
130 135 140 



Pro Val Glu Gin Ser Pro Gin Glu Pro Asp Ser Ser Ser Gly lie Gly 
145 150 155 160 



Lys Ser Gly Gin Gin Pro Ala Lys Lys Arg Leu Asn Phe Gly Gin Thr 
165 ~ 170 175 



Gly Asp Ser Glu Ser Val Pro Asp Pro Gin Pro Leu Gly Glu Pro Pro 
180 185 190 



Glu Ala Pro Ser Gly Leu Gly Pro Asn Thr Met Ala Ser Gly Gly Gly 
195 200 205 



Ala Pro Met Ala Asp Asn Asn Glu Gly Ala Asp Gly Val Gly Asn Ser 
210 215 220 



Ser Gly Asn Trp His Cys Asp Ser Thr Trp Leu Gly Asp Arg Val lie 
225 230 235 240 



Thr Thr Ser Thr Arg Thr Trp Ala Leu Pro Thr Tyr Asn Asn His Leu 
245 * 250 255 



Tyr Lys Gin lie Ser Asn Gly Thr Ser Gly Gly Ser Thr Asn Asp Asn 
260 265 270 



Thr Tyr Phe Gly Tyr Ser Thr Pro Trp Gly Tyr Phe Asp Phe Asn Arg 
275 ' 280 "* " 285 



Phe His Cys His Phe Ser Pro Arg Asp Trp Gin Arg Leu lie Asn Asn 
290 295 300 
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Asn Trp Gly Phe Arg Pro Lys Arg Leu Asn Phe Lys Leu Phe Asn He 
305 310 315 320 



Gin Val Lys Glu Val Thr Thr Asn Glu Gly Thr Lys Thr He Ala Asn 
325 330 ~ 335 



Asn Leu Thr Ser Thr Val Gin Val Phe Thr Asp Ser Glu Tyr Gin Leu 
340 345 350 



Pro Tyr Val Leu Gly Ser Ala His Gin Gly Cys Leu Pro Pro Phe Pro 
355 360 365 



Ala Asp Val Phe Met Val Pro Gin Tyr Gly Tyr Leu Thr Leu Asn Asn 
370 375 380 



Gly Ser Gin Ala Leu Gly Arg Ser Ser Phe Tyr Cys Leu Glu Tyr Phe 
385 390 395 "* " 400 



Pro Ser Gin Met Leu Arg Thr Gly Asn Asn Phe Gin Phe Ser Tyr Thr 
405 410 415 



Phe Glu Asp Val Pro Phe His Ser Ser Tyr Ala His Ser Gin Ser Leu 
420 425 430 



Asp Arg Leu Met Asn Pro Leu He Asp Gin Tyr Leu Tyr Tyr Leu Val 
435 440 ' 445 



Arg Thr Gin Thr Thr Gly Thr Gly Gly Thr Gin Thr Leu Ala Phe Ser 
450 455 460 



Gin Ala Gly Pro Ser Ser Met Ala Asn Gin Ala Arg Asn Trp Val Pro 
465 470 . 475 480 



Gly Pro Cys Tyr Arg Gin Gin Arg Val Ser Thr Thr Thr Asn Gin Asn 
485 " 490 495 



Asn Asn Ser Asn Phe Ala Trp Thr Gly Ala Ala Lys Phe Lys Leu Asn 
500 505 510 



Gly Arg Asp Ser Leu Met Asn Pro Gly Val Ala Met Ala Ser His Lys 
515 520 525 
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Asp Asp Glu Asp Arg Phe Phe Pro Ser Ser Gly Val Leu lie Phe Gly 
530 535 ~ 540 



Lys Gin Gly Ala Gly Asn Asp Gly Val Asp Tyr Ser Gin Val Leu lie 
545 -550 — 555 560 



Thr Asp Glu Glu Glu lie Lys Ala Thr Asn .Pro Val Ala Thr Glu Glu 
565 570 575 



Tyr Gly Ala Val Ala lie Asn Asn Gin Ala Ala Asn Thr Gin Ala Gin 
580 585 590 



Thr Gly Leu Val His Asn Gin Gly Val lie Pro Gly Met Val Trp Gin 
595 600 605 



Asn Arg Asp Val Tyr Leu Gin Gly Pro lie Trp Ala Lys lie Pro His 
610 615 620 



Thr Asp Gly Asn Phe His Pro Ser Pro Leu Met Gly Gly Phe Gly Leu 
625 630 635 ~ 640 



Lys His Pro Pro Pro Gin lie Leu lie Lys Asn Thr Pro Val Pro Ala 
645 650 655 



Asp Pro Pro Leu Thr Phe Asn Gin Ala Lys Leu Asn Ser Phe lie Thr 
660 665 " 670 



Gin Tyr Ser Thr Gly Gin Val Ser Val Glu He Glu Trp Glu Leu Gin 
675 680 685 



Lys Glu Asn Ser Lys Arg Trp Asn Pro. Glu He Gin Tyr Thr Ser Asn 
690 695 700 



Tyr Tyr Lys Ser Thr Asn Val Asp Phe Ala Val Asn Thr Glu Gly Val 
705 710 715 " 720 



Tyr Ser Glu Pro Arg Pro He Gly Thr Arg Tyr Leu Thr Arg Asn Leu 
725 730 735 



<210> 8 

<211> 737 

<212> PRT 

<213> adeno-associated virus serotpye 7 
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<400> 8 

Met Ala Ala Asp Gly Tyr Leu Pro Asp Trp Leu Glu Asp Asn Leu Ser 
15 10 15 



Glu Gly He Arg Glu Trp Trp Asp Leu Lys Pro Gly Ala Pro Lys Pro 
20 25 30 



Lys Ala Asn Gin Gin Lys Gin Asp Asn Gly Arg Gly Leu Val Leu Pro 
35 40 45 



Gly Tyr Lys Tyr Leu Gly Pro Phe Asn Gly Leu Asp Lys Gly Glu Pro 
50 55 60 



Val Asn Ala Ala Asp Ala Ala Ala Leu Glu His Asp Lys Ala Tyr Asp 
65 70 75 ~ 80 



Gin Gin Leu Lys Ala Gly Asp Asn Pro Tyr Leu Arg Tyr Asn His Ala 
85 90 J 95 



Asp Ala Glu Phe Gin Glu Arg Leu Gin Glu Asp Thr Ser Phe Gly Gly 
100 105 ~ 110 



Asn Leu Gly Arg Ala Val Phe Gin Ala Lys Lys Arg Val Leu Glu Pro 
115 120 125 



Leu Gly Leu Val Glu Glu Gly Ala Lys Thr Ala Pro Ala Lys Lys Arg 
130 135 140 



Pro Val Glu Pro Ser Pro Gin Arg Ser Pro Asp Ser Ser Thr Gly He 
145 150 155 160 



Gly Lys Lys Gly Gin Gin Pro Ala Arg Lys Arg Leu Asn Phe Gly Gin 
165 170 175 



Thr Gly Asp Ser Glu Ser Val Pro Asp Pro Gin Pro Leu Gly Glu Pro 
180 185 190 



Pro Ala Ala Pro Ser Ser Val Gly Ser Gly Thr Val Ala Ala Gly Gly 
195 200 205 



Gly Ala Pro Met Ala Asp Asn Asn Glu Gly Ala Asp Gly Val Gly Asn 
210 215 220 
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Ala Ser Gly Asn Trp His Cys Asp Ser Thr Trp Leu Gly Asp Arg Val 
225 230 235 240 



lie Thr Thr Ser Thr Arg Thr Trp Ala Leu Pro Thr Tyr Asn Asn His 
245 250 "* 255 



Leu Tyr Lys Gin lie Ser Ser Glu Thr Ala Gly Ser Thr Asn Asp Asn 
260 265 " 270 



Thr Tyr Phe Gly Tyr Ser Thr Pro Trp Gly Tyr Phe Asp Phe Asn Arg 
275 280 285 



Phe His Cys His Phe Ser Pro Arg Asp Trp Gin Arg Leu lie Asn Asn 
290 295 " * 300 



Asn Trp Gly Phe Arg Pro Lys Lys Leu Arg Phe Lys Leu Phe Asn lie 
305 310 315 320 



Gin Val Lys Glu Val Thr Thr Asn Asp Gly Val Thr Thr He Ala Asn 
325 330 335 



Asn Leu Thr Ser Thr He Gin Val Phe Ser Asp Ser Glu Tyr Gin Leu 
340 345 350 



Pro Tyr Val Leu Gly Ser Ala His Gin Gly Cys Leu Pro Pro Phe Pro 
355 360 " 365 



Ala Asp Val Phe Met He Pro Gin Tyr Gly Tyr Leu Thr Leu Asn Asn 
370 375 - 380 



Gly Ser Gin Ser Val Gly Arg Ser Ser Phe Tyr Cys Leu Glu Tyr Phe 
385 390 395 400 



Pro Ser Gin Met Leu Arg Thr Gly Asn Asn Phe Glu Phe Ser Tyr Ser 
405 " 410 415 



Phe Glu Asp Val Pro Phe His Ser Ser Tyr Ala His Ser Gin Ser Leu 
420 425 430 



Asp Arg Leu Met Asn Pro Leu He Asp Gin Tyr Leu Tyr Tyr Leu Ala 
435 440 445 
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Arg Thr Gin Ser Asn Pro Gly Gly Thr Ala Gly Asn Arg Glu Leu Gin 
450 455 460 



Phe Tyr Gin Gly Gly Pro Ser Thr Met Ala Glu Gin Ala Lys Asn Trp 
46"5~~ 470 475 480 



Leu Pro Gly Pro Cys Phe Arg Gin Gin Arg Val Ser Lys Thr Leu Asp 
485 490 495 



Gin Asn Asn Asn Ser Asn Phe Ala Trp Thr Gly Ala Thr Lys Tyr His 
500 505 J 510 



Leu Asn Gly Arg Asn Ser Leu Val Asn Pro Gly Val Ala Met Ala Thr 
515 520 525 



His Lys Asp Asp Glu Asp Arg Phe Phe Pro Ser Ser Gly Val Leu lie 
530 535 540 



Phe Gly Lys Thr Gly Ala Thr Asn Lys Thr Thr Leu Glu Asn Val Leu 
545 550 555 560 



Met Thr Asn Glu Glu Glu lie Arg Pro Thr Asn Pro Val Ala Thr Glu 
565 570 575 



Glu Tyr Gly He Val Ser Ser Asn Leu Gin Ala Ala Asn Thr Ala Ala 
580 585 590 



Gin Thr Gin Val Val Asn Asn Gin Gly Ala s Leu Pro Gly Met Val Trp 
595 600 605 



Gin Asn Arg Asp Val Tyr Leu Gin Gly Pro He Trp Ala Lys He Pro 
610 615 620 



His Thr Asp Gly Asn Phe His Pro Ser Pro Leu Met Gly Gly Phe Gly 
625 630 635 " 64 0 



Leu Lys His Pro Pro Pro Gin He Leu He Lys Asn Thr Pro Val Pro 
645 650 " 655 



Ala Asn Pro Pro Glu Val Phe Thr Pro Ala Lys Phe Ala Ser Phe He 
660 665 ~ 670 ' 



Thr Gin Tyr Ser Thr Gly Gin Val Ser Val Glu He Glu Trp Glu Leu 
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